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PRODUCTION OF FIBRINOGEN IN 
TRANSGENIC ANIMALS 

BACKGROUND OF THE INVENTION 

The final step in the blood coagulation cascade is the 5 
tbxombm-catalyzed conversion of the soluble plasma protein 
fibrinogen to insoluble fibrin. Thrombin cleaves a small 
peptide (fibrin opeptide A) from one of the three component 
chains (the Aa-chain) of fibrinogen. Fibrin monomers sub- 
sequently polymerize and are cross-linked by activated 1Q 
factor XIQ to form a stable clot 

Fibrinogen is a key component of biological tissue glues 
(see, e.g., U.S. Pat Nos. 4377,572 and 4,442,655), which 
mimic the formation of natural blood ^lots to promote 
hemostasis and repair damaged tissue. Tissue glues provide 15 
an adjunct or alternative to sutures, staples and other 
mechanical means for wound closure. However, the princi- 
pal ingredients of these products (fibrinogen* factor XEH and 
thrombin) are prepared from pooled human plasma by 
cryoprecipitation (e.g. VS. Pat Nos. 4377 J72; 4362367; ^ 
4,909,25 1) or ethanol precipitation (e.g. U.S. Pat No. 4,442, 
655) or from single donor plasma (e.g. U.S. Pat No. 
4,627,879; Spotnitz et al., Am. Surg. 55: 166-168, 1989). 
The resultant fibrinogen/factor XIII preparation is mixed 
with bovine thrombin immediately before use to convert the ^ 
fibrinogen to fibrin and activate the factor XHL thus initi- 
ating coagulation of the adhesive. 

Commercially available adhesives are of pooled plasma 
origin. Because blood-derived products have been associ- 
ated with the transmission of human immunodeficiency 30 
virus (HIV), hepatitis virus and other euologic agents, the 
acceptance and availability of such adhesives is limited. At 
present they are not approved for use in the United States. 

While the use of autologous plasma reduces the risk of 
disease transmission, autologous adhesives can only be used 3s 
in elective surgery when the patient is able to donate the 
necessary blood in advance. 

As noted above, fibrinogen consists of three polypeptide 
chains* each of which is present in two copies in the 
assembled molecule. These chains, designated the Act, BP 40 
and Y-chains, are coordinately expressed, assembled and 
secreted by the liver. While it might be expected that 
recombinant DNA technology could provide an alternative 
to the isolation of fibrinogen from plasma, this goal has 
proven to be elusive. The three fibrinogen chains have been 45 
individually expressed in £. coli (Lord. DNA 4: 33-38, 
1985; Bolyard and Lord, Gene 66: 183-192, 1988; Bolyard 
and Lord, Blood 73: 1202-1206), but functional fibrinogen 
has not been produced in a prokaryotic system. Expression 
of biologically competent fibrinogen in yeast has not been 50 
reported. Cultured transfected mammalian cells have been 
used to express biologically active fibrinogen (Farrell et aL, 
Blood 74: 55a, 1989; Hartwig and Danishefsky, 7. Biol 
Chem. 266: 6578-6585, 1991; Farrell et aL, Biochemistry 
30: 9414-9420, 1991), but expression levels have been so 55 
low that production of recombinant fibrinogen in commer- 
cial quantities is not feasible. Experimental evidence sug- 
gests that lower transcription rates in cultured cells as 
compared to liver may be a factor in the low expression rates 
achieved to date, but increasing the amount of fibrinogen 60 
chain mRNA in transfected BHK cells did not produce 
corresponding increases in fibrinogen protein secretion 
(Prunkard and Foster, XIV Congress of the International 
Society on Thrombosis and Haemostasis, 1993). These latter 
results suggest that proper assembly and processing of 65 
fibrinogen involves tissue-specific media nisms not present 
in common laboratory cell lines. 



2 

Hierc remains a need in the art for methods of producing 
large quantities of high quality fibrinogen for use in tissue 
adhesives and other applications. There is a further need for 
fibrinogen that is free of blood-borne pathogens. The present 
5 invention fulfills these needs and provides other, related 
advantages. 

SUMMARY OF THE INVENTION 

It is an object of fee present invention to provide com- 

10 mercialfy useful quantities of recombinant fibrinogen, par- 
ticulariy recombinant human fibrinogen. It is a further object 
of the invention to provide materials and methods for 
expressing fibrinogen in the mammary tissue of transgenic 
animals, particularly livestock animals such as cattle, sheep, 

15 pigs and goats. 

Within one aspect, the present invention provides a 
method for producing fibrinogen comprising (a) providing a 
first DNA segment encoding a secretion signal operably 
linked to a fibrinogen Act chain, a second DNA segment 

20 encoding a secretion signal operably linked to a fibrinogen 
BP chain, and a third DNA segment encoding a secretion 
signal operably linked to a fibrinogen y chain, wherein each 
of the first, second and third segments is operably linked to 
additional DNA segments required for its expression in the 

25 mammary gland of a host female mammal; (b) introducing 
the DNA segments into a fertilized egg of a non-human 
T^mmaiian species; (c) inserting the egg into an oviduct or 
uterus of a female of the species to obtain offspring carrying 
the DNA constructs; (d) breeding the offspring to produce 

30 female progeny that express the first, second and third DNA 
segments and produce milk containing biocompetent 
fibrinogen encoded by the segments; (e) collecting milk 
from the female progeny; and (f) recovering the fibrinogen 
from the milk Within one einbodiment, the egg containing 

35 the introduced segments is cultured for a period of time prior 
to insertion. 

Within another aspect, the invention provides a method of 
producing fibrinogen comprising the steps of (a) incorpo- 

^ rating a first DNA segment encoding a secretion signal 
operably linked to an Act chain of fibrinogen into a 
(Hactoglobulin gene to produce a first gene fusion; (b) 
incorporating a second DNA segment encoding a secretion 
signal operably linked to a Bp chain of fibrinogen into a 

45 ^-lactoglobulin gene to produce a second gene fusion; (c) 
incorporating a third DNA segment encoding a secretion 
signal operably linked to a y chain of fibrinogen into a 
P-lactoglobulin gene to produce a third gene fusion; (d) 
introducing the first second and third gene fusions into the 

yj germ line of a non-human mammal so that the DNA seg- 
ments are express ed in a mammary gland of the mamm al or 
its female progeny and biocompetent fibrinogen is secreted 
into TnflV of the mammal or its female progeny; (e) obtaining 
iniiir from the mammal or its female progeny; and (f) 

55 recovering the fibrinogen from the milk. Within preferred 
embodiments, the mammal is a sheep, pig, goat or bovine. 

Within another aspect, the invention provides a method 
for producing fibrinogen comprising the steps of (a) provid- 
ing a transgenic female non-human mammal carrying in its 

60 germline heterologous DNA segments encoding AO, BP and 
y rhflins of fibrinogen, wherein the DNA segments are 
expressed in a mammary gland of the mammal and fibrino- 
gen encoded by the DNA segments is secreted into rnflk of 
the mammal; (b) collecting milk from the m a mm al ; and (c) 

65 recovering the fibrinogen from the milk. 

Within another aspect, the invention provides a non- 
human m amTT>aTian embryo containing in its nucleus heter- 
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oiogous DNA segments encoding Act, Bp and y chain s of 
fibrinogen. Within a related aspect, the invention provides a 
transgenic non-human female mammal that produces recov- 
erable amounts of human fibrinogen in its milk. 

Within another aspect, the invention provides a method 
for producing a transgenic offspring of a mammal compris- 
ing the steps of (a) providing a first DNA segment encoding 
a fibrinogen Aa chain, a second DNA segment encoding a 
fibrinogen Bp chain, and a third DNA segment encoding a 
fibrinogen y chain, wherein each of said fine second and 
third segments is operably linked to additional DNA seg- 
ments required for its expression in a mammary gland of a 
host female mammal and secretion into milk of the host 
female mammal; (b) introducing the DNA segments into a 
fertilized egg of a mammal of a non-human species; (c) 
inserting the egg into an oviduct or uterus of a female of the 
non-human species to obtain an offspring carrying the first 
second and third DNA segments. In a related aspect, the 
invention provides non-human mammals produced accord- 
ing to this process. 

Within an additional aspect, the invention provides a 
non-human mammal carrying its germline DNA segments 
encoding heterologous Aa, Bp and y chains of fibrinogen, 
wherein female progeny of the mammal express the DNA 
segments in a mammary gland to produce biocompetent 
fibrinogen. 

These and other aspects of the invention will become 
evident to the skilled practitioner upon reference to the 
following detailed description and the attached drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates the subcloning of a human fibrinogen 
Aa chain DNA sequence. 

FIG. 2 is a partial restriction map of the vector Zem228. 35 
Symbols used are MT-lp, mouse metallothionein promoter; 
SV40L SV40 terminator; and SV40p, SV40 promoter. 

FIG. 3 illustrates the subcloning of a human fibrinogen BP 
chain DNA sequence. 

FIG. 4 illustrates the subcloning of a human fibrinogen y 40 
chain DNA sequence. 

FIG. 5 is a partial restriction map of the vector Zem219b. 
Symbols used are MT-lp, mouse metallothionein promoter, 
hGHt, human growth hormone terminator, SV40p, SV40 
promoter; DHFR, dihydrofolate reductase gene; and SV40t 43 
SV40 terminator. 

DETAILED DESCRIPTION OF THE 
INVENTION 

50 

Prior to setting forth the invention in detail, it will be 
helpful to define certain terms used herein: 

As used herein, the term "biocompetent fibrinogen*' is 
used to denote fibrinogen that polymerizes when treated 
with thrombin to form insoluble fibrin. 55 

The term "egg" is used to denote an unfertilized ovum, a 
fertilized ovum prior to fusion of the pronuclei or an early 
stage embryo (fertilized ovum with fused pronuclei). 

A **female mammal that produces milk containing bio- 
competent fibrinogen" is one that, following pregnancy and 60 
delivery, produces, during the lactation period, milk con- 
taining recoverable amounts of biocompetent fibrinogen. 
Those skilled in the art will recognized that such animals 
will produce milk, and therefore the fibrinogen, discontinue 
ously. 65 

The term "progeny** is used in its usual sense to include 
children and descendants. 
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The term "heterologous* is used to denote genetic mate- 
rial originating from a different species man that into which 
it has been introduced, or a protein produced from such 
genetic material. 

5 Within the present invention, transgenic animal technol- 
ogy is employed to produce fibrinogen within the mammary 
glands of a host female mammal. Expression in the mam- 
mary gland and subsequent secretion of the protein of 
interest into the milk overcomes many difficulties encoun- 

10 tered in isolating proteins from other sources. Milk is readily 
collected, available in large quantities, and well character* 
ized biochemically. Furthermore, the major milk proteins are 
present in rmtk at high concentrations (from about 1 to 15 
g/1). 

15 From a commercial point of view, it is clearly preferable 
to use as the host a species that has a large milk yield. While 
smaller animals such as mice and rats can be used (and are 
preferred at the proof-of-concept stage), within the present 
invention it is preferred to use livestock mammals including, 
20 but not limited to, pigs, goats, sheep and cattle. Sheep are 
particularly preferred due to such factors as the previous 
history of transgenesis in mis species, milk yield, cost and 
the ready availability of equipment for collecting sheep 
Tnnv See WO 88/00239 for a comparison of factors infiu- 
25 encing the choice of host species. It is generally desirable to 
select a breed of host animal that has been bred for dairy use, 
such as East Friesland sheep, or to introduce dairy stock by 
breeding of the transgenic line at a later date. Id any event. 
animals of known, good health status should be used. 
30 Fibrinogen produced according to the present invention 
may be human fibrinogen or fibrinogen of a non-human 
animal For medical uses, it is preferred to employ proteins 
native to the patient. The present invention thus provides 
fibrinogen for use in both human and veterinary medicine. 
35 Cloned DNA molecules encoding the component chains of 
human fibrinogen are disclosed by Rixon et aL (Biochem. 
22: 3237. 1983), Chung et aL {Biochem, 22: 3244, 1983), 
Chung et aL (Biochem. 22: 3250, 1983), Chung et aL (Adv. 
Exp, Med. Biol 281: 39-48, 1990) and Chung et aL (Ann. 
40 NY Acad ScL 408: 449-456, 1983). Bovine fibrinogen 
clones are disclosed by Brown et aL (Nuc. Acids Res. 17: 
6397, 1989) and Chung et al. (Proc. Natl Acad. Sci. USA 78: 
1466-1470, 1981), Other mammalian fibrinogen clones are 
disclosed by Murakawa et aL (Thromb. Haemost. 69: 
45 351-360, 1993). Representative sequences of human Act, 
Bp and y chain genes are shown in SEQ ID NOS: 1, 3 and 
5, respectively. Those skilled in the art will recognize that 
allelic variants of these sequences will exist; that additional 
variants can be generated by amino acid substitution, 
50 deletion, or insertion; and that such variants are useful 
within the present invention. In generaL it is preferred that 
any engineered variants comprise only a limited number of 
amino acid substitutions, deletions, or insertions, and that 
any substitutions are conservative. Thus, it is preferred to 
55 produce fibrinogen chain polypeptides that are at least 90%, 
preferably at least 95, and more preferably 99% or more 
identical in sequence to the corresponding native chains. The 
term **y chain" is meant to include the alternatively spliced 
y chain of fibrinogen (Chung et al., Biochem. 23: 
60 4232-4236, 1984). A human Y chain amino acid sequence is 
shown in SEQ ID NO: 6. The shorter y chain is produced by 
alternative splicing at nucleotides 9511 and 10054 of SEQ 
ID NO: 5, resulting in translation terminating after nucle- 
otide 10065 of SEQ ID NO: 5. 
65 To obtain expression in the mammary gland, a transcrip- 
tion promoter from a milk protein gene is used. Milk protein 
genes include those genes encoding caseins, beta- 
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lactoglobulin (BLG), a-lactalbamin, and wbcy acidic pro- 
tein. The bet^Uctoglobulin promoter is preferred. In the 
case of the ovine beta-lactoglobuiin gene, a region of at least 
the proximal 406 bp of 5 flanking sequence of the ovine 
BUG gene (contained within nucleotides 3844 to 4257 of 5 
SEQ ID NO:7) will generally be used. Larger portions of the 
5 1 flanking sequence, up to about 5 kbp, are preferred. A 
larger DNA segment encompassing the 5* flanking promoter 
region and the region encoding the 5' non-coding portion of 
the beta-lactoglobulin gene (contained within nucleotides 1 10 
to 4257 of SEQ ID NO:7) is particularly preferred. See 
Whitelaw et aL, Bioehem J. 28: 31-39, 1992. Similar 
fragments of promoter DNA from other species are also 
suitable. 

Other regions of the beta-lactoglobulin gene may also be 15 
incorporated in constructs, as may genomic regions of the 
gene to be expressed. It is generally accepted in the art that 
constructs lacking introns, for example, express poorly in 
comparison with those that contain such DNA sequences 
(see Brinster et aL, Proc. Natl Acad. ScL USA 85: 836-840, 20 
1988; Palmiter et aL, Proc. Natl Acad. ScL USA 88: 
478-482, 1991; Whitelaw et aL, Transgenic Res. 1: 3-13, 
1991; WO 89/01343; WO 91/02318). In this regard, it is 
generally preferred, where possible, to use genomic 
sequences containing all or some of fee native introns of a 25 
gene encoding the protein or polypeptide of interest Within 
certain embodiments of the invention, the further inclusion 
of at least some introns from the beta-lactoglobulin gene is 
pref erred. One such region is a DNA segment which pro- 
vides for intron splicing and RNA polyadenylation from the 30 
3* non-coding region of the ovine beta-lactoglobulin gene. 
When substituted for the natural T non-coding sequences of 
a gene, this ovine beta-lactoglobulin segment can both 
enhance and stabilize expression levels of the protein or 
polypeptide of interest. Within other embodiments, the 35 
region surrounding the initiation ATG of one or more of the 
fibrinogen sequences is replaced with corresponding 
sequences from a milk specific protein gene. Such replace- 
ment provides a putative tissue-specific initiation environ- 
ment to enhance expression. It is convenient to replace the 40 
entire fibrinogen chain pre-pro and 5 4 non-coding sequences 
with those of, for example, the BLG gene, although smaller 
regions may be replaced. 

For expression of fibrinogen, DNA segments encoding 
each of the three component polypeptide chains of fibrin o- 45 
gen arc operably linked to additional DNA segments 
required for their expression to produce expression units. 
Such additional segments include the above-mentioned milk 
protein gene promoter, as well as sequences which provide 
for termination of transcription and polyadenylation of 50 
mRNA- The expression units will further include a DNA 
segment encoding a secretion signal operably linked to the 
segment encoding the fibrinogen polypeptide chain. The 
secretion signal may be a native fibrinogen secretion signal 
or may be that of another protein, such as a milk protein. The 55 
term "secretion signal** is used herein to denote that portion 
of a protein mat directs it through the secretory pathway of 
a cell to the outside. Secretion signals are most commonly 
found at the arnmo-tennini of proteins. See, for example, 
von Heinje, Nuc Acids Res. 14: 4683-4690, 1986; and 60 
Meade et aL, ILS. Pat No. 4,873316, which are incorpo- 
rated herein by reference. 
Construction of expression units is conveniently carried 

out by inserting a fibrinogen chain sequence into a plasmid 

or phage vector containing the additional DNA segments, 65 
although the expression unit may be constructed by essen- 
tially any sequence of ligations. It is particularly convenient 
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to provide a vector containing a DNA segment encoding a 
milk protein and to replace me coding sequence for the miDc 
protein with mat of a fibrinogen chain (including a secretion 
signal), thereby creating a gene fusion mat includes the 

5 expression control sequences of the milk protein gene. In 
any event, cloning of the expression units in plasmids or 
other vectors facilitates the amplification of the fibrinogen 
sequences. Amplification is conveniently carried out in 
bacterial (eg. £ coli) host cells, thus the vectors win 

10 typically include an origin of replication and a selectable 
ynarWr functional in hartmal host cells. 

In view of the size of the fibrinogen chain genes it is most 
practical to prepare three separate expression units, mix 
them, and introduce the mixture into the host However, 

15 those skilled in the art win recognize that other protocols 
may be followed For example, expres sion units for the three 
chains can be introduced individually into different embryos 
to be combined later by breeding. In a third approach, the 
three expression units can be linked in a single suitable 

20 vector, such as a yeast artificial chromosome or phage PI 
clone. Coding sequences for two or three chains can be 
combined in polycistronic expression units (see, eg., 
Levinson et aL, US. Pat No. 4,713339). 
The expression unit(s) is(are) men introduced into ferul- 

25 ized eggs (including early-stage embryos) of the chosen host 
species. Introduction of heterologous DNA can be accom- 
plished by one of several routes, including microinjection 
(e.g. U.S. Pat No. 4,873,191), retroviral infection (Jaenisch, 
Science 240: 1468-1474, 1988) or site-directed integration 

30 using embryonic stem (ES) cells (reviewed by Bradley et aL, 
Bio/Technology 10: 534-539, 1992). Hie eggs are men 
iTnpiantpri into the oviducts or uteri of pseudopregnant 
females and allowed to develop to term. Offspring carrying 
the introduced DNA in their germ line can pass the DNA on 

35 to their progeny in the normal, Mendelian fashion, allowing 
the development of transgenic herds. General procedures for 
producing transgenic animals are known in the art See, for 
example, Hogan et aL, Manipulating tixe Mouse Embryo: A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1986; 

40 Simons et aL, Bio/Technology 6: 179-183, 1988; Wall et aL, 
Biol Reprod. 32: 645-651, 1985; Buhler et aL, Bio/ 
Technology : 140-143, 1990; Ebert et aL, Bio/Technology : 
835-838, 1991; Krimpenfort et aL, Bio/Technology 9: 
844-847, 1991; Wall et aL, /. CelL Biochem. 49: 113-120, 

45 1992; and WIPO publications WO 88/00239, WO 90/05188, 
WO 92/11757; and GB 87/00458, which are incorporated 
herein by reference. Techniques for introducing foreign 
DNA sequences into mammals and their germ cells were 
originally developed in the mouse. See, eg., Gordon et aL, 

50 Prvc. NatL Acad. ScL USA 77: 7380-7384, 1980; Gordon 
and Ruddle, Science 214: 1244-1246, 1981; Palmiter and 
Brinster, CeU 41: 343-345, 1985; Brinster et aL, Proc. NatL 
Acad ScL USA 82: 4438-4442, 1985; and Hogan et aL 
(ibid.). These techniques were subsequently adapted for use 

55 with larger animals, including livestock species (see cg~, 
WIPO publications WO 88/00239, WO 90/05188, and WO 
92/11757; and Simons et aL, Bio/Technology 6: 179-183, 
1988). To summarize, in the most efficient route used to date 
in the generation of transgenic mice or livestock, several 

6o hundred linear molecules of the DNA of interest are injected 
into one of the pro-nuclei of a fertilized egg. Injection of 
DNA into the cytoplasm of a zygote can also be employed. 

It is preferred to obtain a balanced expression of each 
fibrinogen chain to allow for efficient formation of the 

65 mature protein. Ideally, the three expression units should be 
on the same DNA molecule for introduction into eggs. This 
approach, however, may generate technical problems at, for 
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example, the injection and manipulation stages. Fox 
example, the size of fibrinogen expression units may neces- 
sitate the use of yeast artificial chromosomes (YACs) or 
phage PI to amplify and manipulate the DNA prior to 
injection. If this approach is followed, segments of DNA to 5 
be injected, containing all three expression units, would be 
very large, thus requiring modification of the injection 
procedure using, for example, larger bore needles. In a more 
simple approach, a mixture of each individual expression 
unit is used. It is preferred to combine equimolar amounts of 10 
the three expression units, although those skilled in the art 
will recognize that this ratio may be varied to compensate 
for the characteristics of a given expression unit. Some 
expression, generally a reduced level, will be obtained when 
lesser molar amounts of one or two chains are used, and 15 
expression efficiencies can generally be expected to decline 
in approximate proportion to the divergence from the pre- 
ferred equimolar ratio. In any event, it is preferred to use a 
mixture having a ratio of AaiBPry expression units in the 
range of 0.5-1:0.5-1:0.5-1. When the ratio is varied from 20 
equimolar, it is preferred to employ relatively more of the 
Bp expression unit. Alternatively, one or a mixture of two of 
the expression units is introduced into individual eggs. 
However, animals derived by this approach will express 
only one or two fibrinogen chains. To generate an intact ^ 
fibrinogen molecule by this approach requires a subsequent 
breeding program designed to combine all three expression 
units in individuals of a group of animals. 

In general, female animals are superovulated by treatment 
with follicle stimulating hormone, then mated. Fertilized 30 
eggs arc collected, and the heterologous DNA is injected 
into the eggs using known methods. See, for example, U.S. 
Pat. No. 4,873,191; Gordon et aL, Proc. Natl Acad. Sci. USA 
77: 7380-7384, 1980; Gordon and Ruddle, Science 214: 
1244-1246, 1981; Palmiter and Brinster, CelL 41: 343-345, 35 
1985; Brinster et aL, Prvc. Natl Acad ScL USA 82: 
443S-4442. 1985; Hogan et aL. Manipulating the Mouse 
Embryo: A Laboratory Manual, Cold Spring Harbor 
Laboratory, 1986; Simons et aL Bio/Technology 6: 179-183, 
1988; Wall et aL, Biol Reprod. 32: 645-651, 1985; Buhler 40 
et aL, Bio/Technology 8: 140-143, 1990; Ebert et aL, Bio/ 
Technology 9: 835-838, 1991; Krimpenfort et aL, Bio/ 
Technology 9: 844-847, 1991; Wall et aL,/. Cell. Biochem. 
49: 113-120, 1992; WIPO publications WO 88/00239, WO 
90/05118, and WO 92/11757; and GB 87/00458, which arc 45 
incorporated herein by reference. 

For injection into fertilized eggs, the expression units are 
removed from their respective vectors by digestion with 
appropriate restriction enzymes. For convenience, it is pre- 
ferred to design the vectors so that the expression units are 50 
removed by cleavage with enzymes that do not cut either 
within the expression units or elsewhere in the vectors. The 
expression units are recovered by conventional methods, 
such as electro-elution followed by phenol extraction and 
cthanol precipitation, sucrose density gradient 55 
centrifugation. or combinations of these approaches. 

DNA is injected into eggs essentially as described in 
Hogan et aL, ibid. In a typical injection, eggs in a dish of an 
embryo culture medium are located using a stereo zoom 
microscope (x50 or x63 magnification preferred). Suitable 60 
media include Hepes (N-2-hydroxyethylpiperazine-N , -2- 
ethanesulphonic acid) or bicarbonate buffered media such as 
M2 or M16 (available from Sigma Chemical Co., St Louis, 
USA) or synthetic oviduct medium (disclosed below). The 
eggs are secured and transferred to the center of a glass slide 65 
on an injection rig using, for example, a drummond pipette 
complete with capillary tube. Viewing at lower (eg. x4) 
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magnification is used at this stage. Using the holding pipette 
of the injection rig, the eggs axe positioned centrally on the 
slide. Individual eggs are sequentially secured to the holding 
pipette for injection. For each injection process, the holding 

5 pipette/egg is positioned in the center of the viewing field. 
The injection needle is then positioned directly below the 
egg. Preferably using x40 Nomarslri objectives, both 
tt)fti)i piii»tttr heights are adjusted to focus both the egg and 
the needle. The pronuclei are located by rotating die egg and 
adjusting the holding pipette assembly as necessary. Once 

10 the pronucleus has been located, the height of the manipu- 
lator is altered to focus the pronudear membrane. The 
injection needle is positioned below die egg such mat the 
needle tip is in a position below the center of the pronucleus. 
The position of the needle is then altered using the injection 

15 manipulator assembly to bring the needle and the pronucleus 
into die same focal plane. The needle is moved, via the joy 
stick on the injection manipulator assembly, to a position to 
the right of the egg. With a short, continuous jabbing 
movement, the pronuclear membrane is pierced to leave the 

20 needle tip inside the pronucleus. Pressure is applied to the 
injection needle via the glass syringe until the pronucleus 
swells to approximately twice its volume. At this point, die 
needle is slowly removed. Reverting to lower (eg. x4) 
magnification* the in jected egg is moved to a different area 

25 of the slide, and the process is repeated with another egg. 
After the DNA is injected, the eggs may be cultured to 
allow the pronuclei to fuse, producing one-cell or later stage 
embryos. In general, the eggs are cultured at approximately 
the body temperature of the species used in a buffered 
medium containing balanced salts and serum. Surviving 
embryos are then transferred to pseudopregnant recipient 
females, typically by inserting them into the oviduct or 
uterus, and allowed to develop to term. During 
embryogenesis, the injected DNA integrates in a random 
fashion in the genomes of a small number of the developing 

35 embryos. 

Potential transgenic offspring are screened via blood 
samples and/or tissue biopsies. DNA is prepared from these 
samples and examined for the presence of the injected 
construct by techniques such as polymerase chain reaction 

<o (PCR; see Mullis, US. Pat No. 4,683,202) and Southern 
blotting (Southern, J. MoL Biol 98:503, 1975; Maniatis et 
aL. Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, 1982). Founder transgenic animals, or 
GOs, may be wholly transgenic, having transgenes in all of 

45 their cells, or mosaic, having transgenes in only a subset of 
cells (see, for example, Wiikie et aL, Develop. Bid 118: 
9-18, 1986). In the latter case, groups of germ cells may be 
wholly or partially transgenic. In the latter case, the number 
of transgenic progeny from a founder animal will be less 

so than the expected 50% predicted from Mendelian principles. 
Founder GO animals are grown to sexual maturity and mated 
to obtain offspring, or Gls. The Gls are also examined for 
the presence of the transgene to demonstrate transmission 
from founder GO animals. In the case of male GOs, these 

55 may be mated with several non-transgenic females to gen- 
crate many offspring. This increases the chances of observ- 
ing transgene transmission. Female GO founders may be 
mated naturally, artificially inseminated or superovulated to 
obtain many eggs which are transferred to surrogate moth- 

60 ers. The latter course gives the best chance of observing 
transmission in animals having a limited number of young. 
The above-described breeding procedures are used to obtain 
animals that can pass the DNA on to subsequent generations 
of offspring in the normal, Mendelian fashion, allowing the 

65 development of, for example, colonies (mice), flocks 
(sheep), or herds (pigs, goats and cattle) of transgenic 
animals. 
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The millr from lactating GO and Gl females is examined 
for the expression of the heterologous protein using immu- 
nological techniques such as ELBA (see Harlow and Lane, 
Antibodies, A Laboratory Manual, Cold Spring Harbor 
Laboratory, 1988) and Western blotting (TowbinetaLProe. 5 
Natl Acad. Sci. USA 76: 4350-4354, 1979). For a variety of 
reasons known in the art, expression levels of the heterolo- 
gous protein will be expected to differ between individuals. 

A satisfactory family of animals should satisfy three 
criteria: they should be derived from the same founder GO 10 
animal; they should exhibit stable transmission of the trans- 
gene; and they should exhibit stable expression levels from 
generation to generation and from lactation to lactation of 
individual animals. These principles have been demon- 
strated and discussed (Carver et aL, Bio/Technology 11: 
1263-1270, 1993). Animals from such a suitable family are 
referred to as a "line.** Initially, male animals, GO or Gl, are 
used to derive a flock or herd of producer animals by natural 
or artificial insemination. In this way, many female animals 
containing the same transgene integration event can be 20 
quickly generated from which a supply of rmlk can be 
obtained. 

The fibrinogen is recovered from milk using standard 
practices such as skimming, precipitation, filtration and 
protein chromatography techniques. 25 

Fibrinogen produced according to the present invention is 
useful within human and veterinary medicine, such as in the 
formulation of surgical adhesives. Adhesives of this type are 
known in the art See, for example, U.S. Pat Nos. 4377.572; 
4,442,655; 4.462,567; and 4,627,879, which are incorpo- 30 
rated herein by reference. In general, fibrinogen and factor 
Xm are combined to form a first component that is mixed 
just prior to use with a second component containing throm- 
bin. The thrombin converts the fibrinogen to fibrin, causing ^ 
the mixture to gel, and activates the factor XHL The acti- 
vated factor Xm cross links the fibrin to strengthen and 
stabilize the adhesive matrix. Such adhesives typically con- 
tain from about 30 mg/ml to about 100 mg/ml fibrinogen and 
from about 50 Mg/ml to about 500 ug/ml factor Xffl. They ^ 
may also contain additional ingredients, such as aprotinin, 
albumin, fibronectin, bulking agents, and solubilizers. Meth- 
ods for producing factor Xm are known in the art See, for 
example, U.S. Pat No. 5,204,447. The fibrinogen is also 
useful for coating surfaces of polymeric articles, e.g. syn- 
thetic vascular grafts, as disclosed in U.S. Pat No. 5,272.074 
(incorporated herein by reference). 

The invention is further illustrated by the following 
non-limiting examples. 

EXAMPLES » 
Example I 

The multiple cloning site of the vector pUC18 (Yanisch- 
Perron et at. Gene 33:103-119, 1985) was removed and 
replaced with a synthetic double stranded oligonucleotide 55 
(the strands of which are shown in SEQ ID NO; 8 and SEQ 
ID NO: 27) containing the restriction sites Pvu LMlu I/Eco 
RVTXba I/Pvu I/Mlu I, and flanked by 5' overhangs com- 
patible with the restriction sites Eco RI and Hind IE. pUC18 
was cleaved with both Eco RI and Hind m, the 5' terminal 60 
phosphate groups were removed with calf intestinal 
phosphatase, and the oligonucleotide was ligated into the 
vector backbone. The DNA sequence across the junction 
was confirmed by sequencing, and the new plasmid was 
called pUCPM. $5 

The fJ-lactoglobulin (BLG) gene sequences from 
pSSltgXS (disclosed in WIPO publication WO 88/00239) 
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were excised as a Sal I-Xba I fragment and recloned into the 
vectcr pUCFM that had been cut with Sal I and Xba I to 
construct vector pUCXS. pUCXS is thus a pUC18 deriva- 
tive containing the entire BLG gene from the Sal I site to the 

s Xba I site of phage SSI (Ali and Clark, J. Mol BioL 199: 
415-426, 1988). 

The plasmid pSSltgSE (disclosed in WIPO publication 
WO 88/00239) contains a 1290 bp BLG fragment flanked by 
Sph I and EcoR I restriction sites, a region spanning a unique 

to Not I site and a single Pvu II site which lies in die 5* 
untranslated leader of die BLG mRNA. Into this Pvu II site 
was ligated a double stranded, 8 bp DNA linker (5*- 
GGAEATCC-3') encoding the recognition site for the 
enzyme Eco KV. This plasmid was called pSSltgSE/RV. 

15 DNA sequences bounded by Sph I and Not I restriction sites 
in pSSltgSE/KV were excised by enzymatic digestion and 
used to replace the equivalent fragment in pUCXS. The 
resulting plasmid was called pUCXSKV. The sequence of 
the BLG insert in pUCSXRV is shown in SEQ ID NO: 7, 

20 with the unique Eco KV site at nucleotide 4245 in the 5' 
untranslated leader region of the BLG gene. This site allows 
insertion of any additional DNA sequences under the control 
of the BLG promoter 3* to die transcrgrtion initiation site. 
Using the primers BLGAMP3 (5-TGG ATC CCC TGC 

25 CGG TGC CTC TGG-3'; SEQ ID NO: 9) and BLGAMP4 
(5-AAC GCG TCATCC TCT GTG AGC CAG-3*; SEQ ID 
NO: 10) a PCR fragment of approximately 650 bp was 
produced from sequences immediately 3* to the stop codon 
of the BLG gene in pUCXSRV. The PCR fragment was 

30 engineered to have a BamH I site at its S end and an Mlu I 
site at its 3' end and was cloned as such into BamH I and Mlu 
I cut pGEM7zf[+) (Fromega) to give pDAM200(+). 

pUCXSKV was digested with Kpn I, and the largest, 
vector containing band was gel purified. This band contained 
the entire pUC plasmid sequences and some 3' non-coding 
sequences from the BLG gene. Into this backbone was 
ligated the small Kpn I fragment from pDAM200(+) which, 
in the correct orientation, effectively engineered a BamH I 

^ site at the extreme 5 end of the 2.6 Kbp of die BLG 3' 
flanking region. This plasmid was called pBLAC200. A 2.6 
Kbp da I-Xba I fragment from pBLAC200 was ligated into 
da I-Xba I cut pSF72 vector (Promega), thus placing an 
EcoR V site immediately upstream of the BLG sequences. 

45 This plasmid was called pBLAC210. 

The 2.6 Kbp Eco KV-Xba I fragment from pBLAC210 
was ligated into Eco KV-Xba I cut pUCXSKV to form 
pMAD6. This, in effect, excised all coding and intron 
sequences from pUCXSKV, forming a BLG minigene con- 

50 sisting of A3 Kbp of 5* promoter and 2.6 Kbp of 3* 
downstream sequences flanking a unique EcoR V site. An 
oligonucleotide linker (ZC6839: ACTACGTAGT; SEQ ID 
NO: 11) was inserted into the Eco KV site of pMAD6. This 
modification destroyed the Eco KV site and created a Sna BI 

55 site to be used for cloning purposes. The vector was desig- 
nated pMAD6-Sna. Messenger RNA initiates upstream of 
the Sna BI site and terminates downstream of the Sna BI site. 
The precursor transcript wfll encode a single BLG-derived 
intron, intron 6, which is entirely within the 3' untranslated 

50 region of the gene. 

Example II 

Clones encoding the individual fibrinogen chains were 
obtained from the laboratory of Dr. Earl W. Davie, Univer- 
65 sity of Washington, S e a tt l e . A genomic fibrinogen Act-chain 
clone (Chung et aL, 1990, ibid.) was obtained from the 
plasmid BS4. This plasmid contains the Act clone inserted 
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into the Sal I and Bam HI sites of the vector pUC18, but 
lacks the coding sequence for the first four amino adds of 
the Act chain. A genomic B^-chain DNA (Chung ct aL, ibid.) 
was isolated from a lambda Charon 4A phage clone 
(designated pX4) as two EcoRI fragments of ca. 5.6 Kbp 5 
each. The two fragments were cloned separately into pUC19 
that had been digested with Eco RI and treated with calf 
intestinal phosphatase. The resulting clones were screened 
by digestion with the restriction enzyme Pvu II to distin- 
guish plasmids with the 5' and 3' BP inserts (designated 10 
Beta5*RI/puc and BetaSTU/puc, respectively). Genomic 
y-chain clones were isolated as described by Rixon et aL 
(Biochemistry 24: 2077-2086, 1985). Gone pyl2A9 com- 
prises 5' non-coding sequences and approximately 4535 bp 
of y-chain coding sequence. Qone pyl2F3 comprises the 15 
remaining coding sequence and 3* non-coding nucleotides. 
Both are pBR322-based plasmids with the fibrinogen 
sequences inserted at the EcoRI site. These plasmids were 
used as templates for the respective PCR reactions. 

The fibrinogen chain coding sequences were tailored for 20 
insertion into expression vectors using the polymerase chain 
reaction (PCR) as generally described by Mullis (US. Pat 
No. 4.683,202). This procedure removed native 5* and 3' 
untranslated sequences, added a 9 base sequence (CCT GCA 
GCC) upstream of the first ATG of each coding sequence, 25 
supplied the first four codons for the Aa-chain sequence, 
removed an internal Mlu I site in the Act sequence and added 
restriction sites to facilitate subsequent cloning steps. 

Referring to FIG. 1, the 5 1 end of the Act coding sequence 
was tailored in a PCR reaction containing 20 pmole for each 30 
of primers 2C6632 (SEQ ID NO: 12) and ZC6627 (SEQ ID 
NO: 13), approximately 10 ng of plasmid BS4 template 
DNA, 10 ul of a mix containing Z5 mM each dNTP, 7.5 ul 
lOx Pyrococcus furiosus (Pfu) DNA polymerase buffer #1 
(200 mM Tris-HQ, pH 8.2, 100 mM KOU 60 mM (NHJ 35 
2S0 4 , 20 mM MgQ 2 , 1% Triton X-100, 100 ug/ml nuclease 
free bovine serum albumin)(Stratagene, La Jolla, Calif.), and 
water to 75 ul. The mixture was heated to 94° C in a DNA 
thermal cycler (Perkin-Elmer Corp., Norwalk, Conn.). To 
the heated mixture was added 25 ul of a mixture containing 40 
2.5 ul lOxPfu buffer #1, 22 ul H 2 0 and 1 ul 25 units/ul Pfu 
DNA polymerase (Stratagene). The reactions were run in a 
DNA thermal cycler (Perkin-Elmer) for five cycles of 94°, 
45 seconds; 40°, 90 seconds; 72°, 120 seconds; 20 cycles of 
94°, 45 seconds; 45°, 90 seconds; 72°, 120 seconds; then 45 
incubated at 72° for 7 minutes. The 5* PCR-generated 
fragment was digested with Bam HI and Hind m, and the 
Bam HI-Hind m fragment was then ligated to an internal 
2.91 Kbp Hind IH-Xba I fragment and Bam HL Xba 
I-digested pUC18. PCR-generated exon sequences were 50 
sequenced. 

Referring again to FIG. 1, the 3* end of the Aa coding 
sequence was tailored in a series of steps in which the Mlu 
I site 563 bases upstream from the stop codon of the Aa 
sequence was mutated using an overlap extension PCR 55 
reaction (Ho et aL, Gene 77: 51-59. 1989). In the first 
reaction 40 pmole of each of primers ZC6521 (SEQ ID NO: 
14) and ZC6520 (SEQ ID NO: 15) were combined with 
approximately 10 ng of plasmid BS4 template DNA in a 
reaction mixture as described above. The reaction was run 60 
for 5 cycles of 94°, 45 seconds; 40°, 60 seconds; 72°, 120 
seconds; 15 cycles of 94°, 45 seconds; 45°, 60 seconds; 72% 
120 seconds; then incubated at 72° for 7 minutes. A second 
reaction was carried out in the same manner using 40 pmole 
of each of primers ZC6519 (SEQ ID NO: 16) and ZC6518 65 
(SEQ ID NO: 17) and BS4 as template. The PCR-generated 
DNA fragments from the first and second reactions were 
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isolated by gel dectrophoresis and elation from the geL 
Approximately Vio of each recovered reaction product was 
combined with 40 pmole of each of primers ZC6521 (SEQ 
ID NO: 14) and ZC65 1 8 (SEQ ID NO: 17) in a PCR reaction 

5 in which the complementary 3* ends of each fragment 
(containing the single base change) annealed and served as 
a primer for the 3* extension of the complementary strand. 
PCR was carried out using the same reaction conditions as 
in the first and second 3* PCR steps. The reaction product 

10 was then digested with Xba I and Bam HL and the Xba 
I-Bam HI fragment was cloned into Xba I, Bam Hl-digested 
pUC18. PCR-generated exons were sequenced. 

As shown in FIG. 1, the 5* Bam Hl-Xba I fragment (3.9 
Kbp) and the 3* Xba I-Bam HI fragment (13 Kbp) were 

15 inserted into the Bam HI site of the vector Zem228. Zem228 
is a pUC18 derivative comprising a Bam HI cloning site 
between a mouse MT-1 promoter and SV40 terminator, and 
a neomycin resistance marker flanked by SV40 promoter 
and terminator sequences. See European Patent Office Pub- 

20 lication EP 319,944 and FIG. 2. The entire Act coding 
sequence was isolated from the 7r.m7.28 vector as an Sna BI 
fragment, which was inserted into the Sna BI site of the 
plasmid pMAD6-Sna* 
Referring to FIG. 3, the 5' end of the Bp-chain was 

25 tailored by PCR using the oligonucleotides ZC6629 (SEQ 
ID NO: 18), ZC6630 (SEQ ID NO: 19) and 2C6625 (SEQ 
ID NO: 20). These primers were used in pairwise combi- 
nations (ZC6629+ZC6625 or ZC6630+-ZC6625) to generate 
Bp coding sequences beginning at the first ATG codon 

30 (position 470 in SEQ ID NO: 3)(designated Nl-Beta) or the 
third ATG codon (position 512 in SEQ ID NO: 3X<*esignated 
N3-Beta). Approximately 5 ng of Beta5*RI/puc template 
DNA was combined with 20 pmole of each of the primers 
(Nl-Bet*uZC6629, SEQ ID NO: 18+ZC6625, SEQ ID NO: 

35 20; or N3-Beta:ZC6630, SEQ ID NO: 19+ZC6625, SEQ ID 
NO: 20) in a reaction mixture as described above. The 
mixtures were incubated for 5 cycles of 94°, 45 seconds; 
40°, 120 seconds; (Nl-Beta) or 90 seconds (N3-Beta); 72°, 
120 seconds; 20 cycles of 94°, 45 seconds; 45°, 120 seconds; 

40 (Nl-Beta) or 90 seconds (N3-Beta); 72°, 120 seconds; then 
incubated at 72° for 7 minutes. The two reaction products 
Nl, 555 bp or N3, 5 10 bp) were each digested with Eco RI 
and Bgl H, and the fragments were ligated to the internal Bgl 
n-Xba I fragment and Eco Rl+Xba I-digested pUC19. The 

45 3* end of the Bp sequence was tailored in a reaction mixture 
as described above using the oligonucleotide primers 
ZC6626 (SEQ ID NO: 21) and ZC6624 (SEQ ID NO: 22) 
and approximately 5 ng of Beta3*RI/puc template. The 
mixtures were incubated for 5 cycles of 94°, 45 seconds; 

50 40°, 90 seconds; 72°, 120 seconds; 15 cycles of 94°, 45 
seconds; 45°, 90 seconds; 72°, 120 seconds; then incubated 
at 72° for 7 minutes. A 990 bp Bgl II-Eco RI fragment was 
isolated. This 3' fragment was ligated to the adjacent coding 
fragment (340 bp, Sphl-Bgl II) and Sph I+Eco Rl-digested 

55 pUC19. The J and 5* PCR-generated exons were sequenced. 
A third intermediate vector was constructed by combining 
two internal fragments (4285 bp Xba I-Eco RI and 383 kb 
Eco Rl-Sph I) in Xba IfSph I-digested pUC19. The entire 
Bp coding sequence (two forms) was then assembled by 

60 ligating one of the 5' Eco Rl-Xba I fragments, the internal 
Xba I-Sph I fragment, the 3' Sph I-Eco RI fragment and Eco 
Rl-digested vector pUC19. The Bp sequence was then 
isolated as a 7.6 Kbp Sna BI fragment and inserted into the 
Sna BI site of pMADo-Sna. 

65 Referring to FIG. 4, the 5* end of the gamma chain 
sequence was tailored by PCR using the oligonucleotide 
primers ZC6514 (SEQ ID NO: 23) and ZC6517 (SEQ ID 
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NO: 24) and approximately 50 ng of pyl2A9 as template. 
The PCR reaction was ran as described above using 40 pM 
of each primer. The reaction was run far 5 cycles of 94 ° , 45 
seconds; 40°, 60 seconds, 72% 120 seconds, followed by 15 
cycles of 94°, 45 seconds; 45°, 60 seconds; 72°, 120 5 
seconds. The resulting 213 bp fragment was digested with 
Bam HI and Spe I, and the resulting restriction fragment was 
ligated with the adjacent downstream 4.4 kb Spe I-Eco RI 
fragment and Bam HI+Eco RI digested pUC19. The 3 1 end 
of the gamma chain sequence was tailored using oligonucle- 10 
otide primers ZC6516 (SEQ ID NO: 25) and ZC6515 (SEQ 
ID NO: 26) using 40 pM of each primer* approximately 50 
ng of pyl2F3 template and the same thermal cycling sched- 
ule as used for the 5' fragment The resulting 500 bp 
fragment was digested with Spe I and Bam HI, and the is 
resulting restriction fragment was ligated with the upstream 
2.77 kb Eco Rl-Spe I fragment and Eco RI+Bam 
Hi-digested pUC19. All PCR-generated exons were 
sequenced. The entire y-chain coding sequence was then 
assembled by ligating a 4.5 Kbp Bam Hl-Eco RI 5' fragment, 20 
a 1 . 1 Kbp Eco Rl-Pst I internal fragment and a 2*14 Kbp Pst 
I-Xba I 3' fragment in Bam HlfXba I-digested Zem219b. 
Zem219b is a pUC18-derived vector containing a mouse 
metallothioncin promoter and a DHFR selectable marker 
operably linked to an SV40 promoter (FIG. 5). Plasmid 25 
Zem219b has been deposited with American Type Culture 
Collection as an E. colt XLl-blue transformant under Acces- 
sion No. 68979. The entire y-chain coding sequence was 
then isolated as a 7.8 Kbp Sna Bl fragment and inserted into 
the Sna BI site of pMAD6-Sna. 30 

Example HI 

Mice for initial breeding stocks (C57BL6J. CBACA) 
were obtained from Harlan Olac Ltd. (Bicester. UK). These 
were mated in pairs to produce Fl hybrid cross (B6CBAF1) 35 
for recipient female, superovulated females, stud males and 
vasectomized males. All animals were kept on a 14 hour 
light/10 hour dark cycle and fed water and food (Special 
Diet Services RM3, Edinburgh. Scotland) ad libitum, 

Transgenic mice were generated essentially as described 40 
in Hogan et al.. Manipulating the Mouse Embryo: A Labo- 
ratory Manual, Cold Spring Harbor Laboratory, 1986, 
which is incorporated herein by reference in its entirety. 
Female B6CBAF1 animals were superovulated at 4-5 weeks 
of age by an Lp. injection of pregnant mares* serum gona- 45 
dotrophin (FOLLIGON, Vet-Drug, Falkirk. Scotland) (5 iu) 
followed by an Lp. injection of human chorionic gonadot- 
rophs (CHORULON, Vet-Drug, Falkirk. Scotland) (5 iu) 
45 hours later. They were then mated with a stud male 
overnight Such females were next examined for copulation 50 
plugs. Those that had mated were sacrificed, and their eggs 
were collected for microinjection. 

DNA was injected into the fertilized eggs as described in 
Hogan et aL (ibid.) Briefly, each of the vectors containing 
the Act, B[S and y expression units was digested with Mlu L 55 
and the expression units were isolated by sucrose gradient 
centrifugation. All chemicals used were reagent grade 
(Sigma Chemical Co., St Louis, Mo.. U.S.A.), and all 
solutions were sterile and nuclease-free. Solutions of 20% 
and 40% sucrose in 1M Nad , 20 mM Tris pH 8.0, 5 mM 60 
EDTA were prepared using UHP water and filter sterilized. 
A 30% sucrose solution was prepared by mixing equal 
volumes of the 20% and 40% solutions. A gradient was 
prepared by layering 0.5 ml steps of the 40%, 30% and 20% 
sucrose solutions into a 2 ml polyallomer tube and allowed 65 

to stand for one hour. 100 |il of DNA solution (max. 8 \ig 
DNA) was loaded onto the top of the gradient, and the 
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gradient was centrifugal for 17-20 hours at 26,000 ipm, 15° 
C in a BecbnanTLlOO ultracentrifuge using aTLS-55 rotor 
(Beckman Instruments, Fullerton, Calif., USA). Gradients 
were fractionated by puncturing the tube bottom with a 20 

5 ga. needle and collecting drops in a 96 well nricrotiter plate. 
3 pi aliquots were analyzed on a 1% agarose mini-geL 
Fractions containing the desired DNA fragment were pooled 
and ethanol precipitated overnight at —20° C in 03M 
sodium acetate. DNA pellets were xesuspended in 50-100 pi 

0 UHP water and quantitated by fluorimetry. The expression 
units were diluted in DuIbecco*s phosphate buffered saline 
without calcium and magnesium (containing, per liter, 0.2 g 
KO, 0.2 g KHjPO* 8.0 g Nad, L15 g NaJHPO^, mixed 
(using either the Nl-Beta or N3-Beta expression unit) in a 
1:1:1 molar ratio, concentration adjusted to about 6 pg/ml, 
and injected into the eggs (-2 pi total DNA solution per egg). 

Recipient females of 6-8 weeks of age are prepared by 
mating B6CBAF1 females in natural estrus with vasecto- 
mized males. Females possessing copulation plugs are then 

2Q kept for transfer of microinjected eggs. 

Following birth of potential transgenic animals, tail biop- 
sies are taken, under anesthesia, at four weeks of age. Tissue 
samples are placed in 2 ml of taO buffer (03M Na acetate, 
50 mM HO, 1.5 mM MgCl* lOmMIris-HCUpH 8.5, 0.5% 

25 NP40, 05% Tween 20) containing 200 ug/ml proteinase K 
(Boefaiinger Mannheim, Mannheim, Germany) and vor- 
texed. The samples are shaken (250 rpm) at 55°-60° for 3 
hours to overnight DNA prepared from biopsy samples is 
examined fox the presence of the injected constructs by PCR 

30 and Southern blotting. The digested tissue is vigorously 
vortexed, and 5 ul aliquots are placed in 0.5 ml microcen- 
trifuge tubes. Positive and negative tan samples are included 
as controls. Forty ul of silicone oil (BDH, Poole, UK) is 
added to each tube, and the tubes are briefly centxifuged. The 

35 tubes are incubated in the heating block of a thermal cycler 
(e.g. Omni-gene, Hybaid^Teddington, UK) to 95° C far 10 
minutes. Following this, each tube has a 45 ul aliquot of 
PCR mix added such that the final composition of each 
reaction mix is: 50 mM KG; 2 mM MgC^; 10 mM Iris-HQ 

40 (pH 83); 0.01% gelatin; 0.1% NP40, 10% DMSO; 500 nM 
each primer, 200 uM dNTPs; 0.02 U/ul Taq polymerase 
(Boehringer Mannheim, Mannheim, Germany). The tubes 
are men cycled through 30 repeated temperature changes as 
required by the particular primers used. The primers may be 

45 varied but in all cases must target the BLG promoter region. 
This is specific for the injected DNA fragments because the 
mouse does not have a BLG gene. Twelve ul of 5x loading 
buffer containing Orange G marker dye (025% Orange G 
[Sigma] 15% Flooll type 400 [Pharmacia Biosystems Ltd., 

50 Milton Keynes, UK]) is then added to each tube, and the 
reaction mixtures are electrophoresed on a L6% agarose gel 
containing ethidium bromide (Sigma) until the marker dye 
has migrated % of the length of the geL The gel is visualized 
with a UV light source emitting a wavelength of 254 nm. 

55 Transgenic mice having one or more of the injected DNA 
fragments are identified by this approach. 

Positive tail samples are processed to obtain pure DNA. 
The DNA samples are screened by Southern blotting using 
a BLG promoter probe (nucleotides 2523-4253 of SEQ ID 

60 NO: 7). Specific cleavages with appropriate restriction 
enzymes (eg. Eco RI) allow the distinction of the three 
constructs containing the Act, Bp and y sequences. 

Southern blot analysis of transgenic mice prepared essen- 
tially as described above demonstrated that more than 50% 

65 of progeny contained all three fibrinogen sequences. Exami- 
nation of milk from positive animals by reducing SDS 
polyacrylamide gel electrophoresis demonstrated the pres- 
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cnce of all three protein chains at concentrations up to 1 
mg/mL The amount of fully assembled fibrinogen was 
related to the ratios of individual subunits present in the 
milk No apparent phenotype was associated with high 
concentrations of human fibrinogen in mouse milk. 5 

Example IV 

Donor ewes are treated with an intravaginal progesterone- 
impregnated sponge (CHRONOGEST Goat Sponge. 
Intervet, Cambridge, UK) on day 0. Sponges are left in situ 10 
for ten or twelve days. 

Superovulation is induced by treatment of donor ewes 
with a total of one unit of ovine follicle stimulating hormone 
(OFSH) (OVAGEN, Horizon Animal Reproduction Tech- 
nology Pry. Ltd., New Zealand) administered in eight intra- 
muscular injections of 0.125 units per injection starting at 
5:00 pm on day -4 and ending at 8:00 am on day 0. Donors 
are injected intramuscularly with 0.5 ml of a luteolytic agent 
(ESTRUMATE, Vet-Drug) on day -4 to cause regression of 
the corpus luteum, to allow return to estrus and ovulation. To 
synchronize ovulation, the donor animals are injected intra- 
muscularly with 2 ml of a synthetic releasing hormone 
analog (RECEFTAL, Vet-Drug) at 5:00 pm on day 0. 

Donors are starved of food and water for at least 12 hours 
before artificial insemination (AX). The animals are artifi- 
cially inseminated by intrauterine laparoscopy under seda- 
tion and local anesthesia on day 1. Either xylazine 
(ROMPUN, Vet-Drug) at a dose rate of 0.05-0.1 ml per 10 
kg bodywcight or ACP injection 10 mg/ml (Vet-Drug) at a 
dose rate of 0.1 ml per 10 kg bodywcight is injected 
intramuscularly approximately fifteen minutes before AX to 
provide sedation. AX is carried out using freshly collected 
semen from a Poll Dorset ram. Semen is diluted with equal 
parts of filtered phosphate buffered saline, and 02 ml of the 
diluted semen is injected per uterine horn. Immediately pre- 
or post-AX. donors are given an intramuscular injection of 
AMOXYPEN (Vet-Drug). 

Fertilized eggs are recovered on day 2 following starva- 
tion of donors of food and water from 5:00 pm on day 1. ^ 
Recovery is carried out under general anesthesia induced by 
an intravenous injection of 5% thiopentone sodium 
(TNTRAVAL SODIUM, Vet-Drug) at a dose rate of 3 ml per 
10 kg bodywcight. Anesthesia is maintained by inhalation of 
1-2% Halothane/O^N 2 0 after intubation. To recover the 45 
fertilized eggs, a laparotomy incision is made, and the uterus 
is exteriorized The eggs arc recovered by retrograde flush- 
ing of the oviducts with Ovum Culture Medium (Advanced 
Protein Products, Brierly Hill. West Midlands. UK) supple- 
mented with bovine serum albumin of New Zealand origin, ^ 
After flushing, the uterus is returned to the abdomen, and the 
incision is closed Don on are allowed to recover post- 
operatively or are euthanized Donors that are allowed to 
recover are given an intramuscular injection of Amoxypen 
L.A at the manufacturer's recommended dose rate imme- 
diately pre- or postoperatively. 

Plasmids containing the three fibrinogen chain expression 
units are digested with Mlu L and the expression unit 
fragments are recovered and purified on sucrose density 
gradients. The fragment concentrations are determined by so 
fluorimetry and diluted in Dulbecco's phosphate buffered 
saline without calcium and magnesium as described above. 
The concentration is adjusted to 6 ug/ml and approximately 
2 pi of the mixture is microinjected into one pronucleus of 
each fertilized eggs with visible pronuclei. 55 

All fertilized eggs surviving pronuclear microinjection 
are cultured in vitro at 38.5° C in an atmosphere of 5% 
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C0 2 S% 0 2 $0% N 2 and about -100% humidity in a bicar- 
bonate buffered synthetic oviduct medium (see Table) 
supplemented with 20% v/v vasectamized ram senuu The 
serum may be heat inactivated at 56° C for 30 minutes and 

5 stored frozen at -20° C prior to use. The fertilized eggs are 
cultured for a suitable period of time to allow early embryo 
mortality (caused by the manipulation techniques) to occur. 
These dead or arrested embryos are discarded, Embryos 
having developed to 5 or 6 cell divisions are transferred to 

!0 synchronized recipient ewes. 
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TABLE 


Synmetic Oviduct Medium 


Stock A (Lasts 3 Mouths) 




NaCI 


629 g 


KC1 


0J34 g 


KH2S0 4 


0.162 g 


M£S0 4 :7H20 


0.182 g 


Penicillin 


0j06 g 


Sodium Lactate 60% syrup 


0.6 mis 


Super H z O 


99,4 mis 


Stock B (Lasts 2 weeks) 




NaHCOj 


0.21 g 


Phmnl red 


0.001 g 


Super HjO 


10 mis 


Stock C (Lasts 2 weeks) 




Sodium Pyruvate 


0.051 g 


Super H 2 0 


10 mis 


Stock D (Lasts 3 months) 




CaClj^HjO 


0.262 g 


Super HjO 


10 mis 


Stock £ (Lasts 3 months) 




Hepes 


0.651 g 


Phenol red 


OjOOI g 


Super H 2 0 


10 mis 


lb make up 10 mis of Bicarbonate Buffered 


Medium 




STOCK A 


1 ml 


STOCK B — 


1 ml 


STOCK C 


Oj07 ml 


STOCK D 


0.1 ml 


Super H 2 0 


7.83 ml 


Osmolality should be 265-285 mOsm. 




Add 2-5 ml of heat inactivated sheep sera 


m 


and filter sterilize. 




To nuke up 10 mis HEPES Buffered Medium 


STOCK A 


1 ml 


STOCK B 


0.2 ml 


STOCK C 


Oj07 ml 


STOCK D 


0.1 ml 


STOCK E 


0 J ml 


Super HjO 


7.83 ml 


OsmoUriry should be 255-285 mOsm. 




Add 15 ml of heat inactivated sheep sent 


m 


>pd filter sterilize. 





55 

Recipient ewes are treated with an intravaginai 
progesterone-impregnated sponge (Chronogcst Ewe Sponge 
or Chronogest Ewe-Lamb Sponge, Intervet) left in situ for 
10 or 12 days. The ewes are injected intramuscularly with 

60 1.5 ml (300 iu) of a follicle stimulating hormone substitute 
(RM.S.G., Intervet) and with 0 J ml of a luteolytic agent 
(Estrumate, Coopers Pitman-Moore) at sponge removal on 
day -1. The ewes are tested for estrus with a vasectomized 
ram between 8:00 am and 5:00 pm on days 0 and 1. 

65 Embryos surviving in vitro culture are returned to recipi- 
ents (starved from 5:00 pm on day 5 or 6) on day 6 or 7. 
Embryo transfer is carried out under general anesthesia as 
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described above. The uterus is exteriorized via a laparotomy 
incision with or without laparoscope Embiyos are returned 
to one or both uterine horns only in ewes wife at least one 
suitable corpora lutea. After replacement of the uterus, the 
abdomen is closed, and the recipients are allowed to recover, s 
The animals are given an intramuscular injection of Amox* 
ypen L.A. at the manufacturer's recommended dose rate 
immediately pre- or post-operatively. 

Lambs are identified by ear tags and left with their dams 
for rearing. Ewes and lambs are either housed and fed 10 
complete diet concentrates and other supplements and or ad 
lib. hay, or are let out to grass. 

Within the first week of life (or as soon thereafter as 
possible without prejudicing health), each lamb is tested for 
the presence of the heterologous DNA by two sampling 15 
procedures. A 10 ml blood sample is taken from the jugular 
vein into an EDTA vacutainer. If fit enough, the lambs also 
have a second 10 ml blood sample taken within one week of 
the first. Tissue samples are taken by tail biopsy as soon as 
possible after the tail has become desensitized after the 20 
application of a rubber elastrator ring to its proximal third 
(usually within 200 minutes after "tailing"). The tissue is 
placed immediately in a solution of tail buffer. Tail samples 
are kept at room temperature and analyzed on the day of 
collection. All lambs are given an intramuscular injection of 25 
Amoxypen I~A. at the manufacturer's recommended dose 
rate immediately post-biopsy, and the cut end of the tail is 
sprayed with an antibiotic spray. 

DNA is extracted from sheep blood by first separating ^ 
white blood cells. A 10 ml sample of blood is diluted in 20 
ml of Hank's buffered saline (HBS; obtained from Sigma 



18 

Chemical Co.). Ten ml of the diluted blood is layered over 
5 ml of Histopaque (Sigma) in each of two 15 ml screw- 
capped tubes. The tubes are centrifuged at 3000 ipm (2000x 
g max.), low brake for 15 minutes at room temperature, 
s White cell interfaces are removed to a clean 15 ml tube and 
diluted to 15 ml in HBS.The diluted cells are spun at 3000 
rpmfor 10 minutes at room temperature, and the cell pellet 
is recovered and resuspended in 2-5 ml of tail buffer. 

10 To extract DNA from the white cells, 10% SDS is added 
to the resuspended cells to a final concentration of 19b, and 
the tube is inverted to mix the solution. One mg of fresh 
proteinase K solution is added, and the mixture is incubated 
overnight at 45° C DNA is extracted using an equal volume 

15 of phenol/chloroform (x3) and chlarofornVisoamyl alcohol 
(xl). The DNA is then precipitated by adding 0.1 volume of 
3M NaOAc and 2 volumes of ethanol, and the tube is 
inverted to mix. The precipitated DNA is spooled out using 
a dean glass rod with a sealed end. The spool is washed in 

2Q 70% ethanol, and the DNA is allowed to partially dry, then 
is redissolved in TE (10 mM Tris-HQ, 1 mM EDTA, pH 
7.4). 

DNA samples from blood and tail are analyzed by South- 
ern blotting using probes for the BLG promoter region and 
25 the fibrinogen chain coding regions. 

From the foregoing, it will be appreciated that although 
specific embodiments of the invention have been described 
herein for purposes of illustration, various modifications 
3Q may be made without deviating from the spirit and scope of 
the invention. Accordingly, the invention is not limited 
except as by the appended claims. 



SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION: 

(tit ) NUMBER OP SEQUENCES: 27 



( 2 ) INFORMATION PORSEQ ID NCnl: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 3943 bate jw» 
(B ) TYPE: anckac acid 
{ C ) STRANDEDNESS: doable 
{ D ) TOPOLOGY: iacar 

( i i ) MOLECULE TYPE: DNA (fcaccoic) 

(vii ) IMMEDIATE SOURCE: 

( B ) CLONE: Bocoaa Rfanaofe* A-alph* chata 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 

< B ) LOCATION: jcia<3 1154-1279, 1739-1922, 3055-3200. 
37*5,-5210) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GTCTAOOAGC CAOCCCCACC CTTAOAAAAG ATO TTT TCC ATO AGO ATC OTC TOC 

Met Pbc Set Met Arg lie Val Cyi 

1 5 

CTA OTT CTA ACT GTG OTG OOC ACA OCA TOG GTATGGCCCT TTTCATTTTT 
Leu Val Leo Ser V.l V»l Oly Tbr All Trp 
10 15 

TCTTCTTGCT TTCTCTCTOO TGTTT ATTCC ACAAAGAGCC TGGAOOTCAO AGTCTACCTG 

CTCTATOTCC TOACACACTC TTAGCTTTAT OACCCCAOGC CTGOOAOGAA ATTTCCTOGO 
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TOOOCTTOAC ACCTCAAOAA TACAOOOTAA TATOACACCA AOAOOAAOAT CTTAOATOOA 2*4 

TGAGAGTOTA C A ACT AC A AG GOAAACTTTA GCATCTGTCA TTC AOTCTTA CCAC ATTTTO 344 

TTTTGTTTTG TTTT AAAAAG OOCAAOAATT ATTTOCCATC CTTOT ACCTA TAAAOCCTTO 404 

GTGC ATT AT A ATGCT AGTT A ATGGAATAAA ACATTTT ATO OTAAGATTTG TTTTCTTTAO 464 

TTATTAATTT CTTGCTACTT CTCCATAATA AGCAG AACTT TTAOTGTTAG TACAOTTTTG 324 

CT OAA AGGTT ATTGTTGTGT TTGTCAAGAC AG A AG AA AAA OCAAACG AAT TATCTTTOOA 5*4 

AATATCTTTG CAOTATCAOA AG AO ATT ACT TAGTAAGOCA ATACOCTTTT CCGCAGTAAT 644 

GGTATTCTTT T AA ATT ATGA ATCCATCTCT AA AGGTT AC A TAOAAACTTO A AGO AG A GAG 704 

GAACATTCAG TTAAOATAGT CTAGOTTTTT CTACTGAAGC AOCAATTACA OOAOAAAOAO 764 

CTCTACAGTA GTTTTCAACT TTCTGTCTOC AOTCATTAGT AA AAA TO AAA AOGTAAAATT *24 

TAACTGATTT TATAOATTCA AATAATTTTC CTT TT AGO AT GOATTCTTTA AAACTCCTAA **4 

TATTTATCAA ATGC TT ATTT AAGTOTCACA C AC AGTT A AG A AATTTGT AC ACCTTOTCTC 944 

CTTTAATTCT C AT A ACA ACT CCAT AAAATG OGTCCTAOGA TTTCCATTTO AAOATAAGAA 1004 

ACCTGAAOCT TGCCGAAGCC CTGTGTCTOC TCTCCTTAAT CTCTOTOAGA GTGCCAT CTC 10*4 

TT CCTGGGGA CTTGT AGGC A TGCCACTOTC TCCTCTTCTG GCTAACATTO CTGTTOCTCT 1124 

CTTTTOTOTA TOTOAATOAA TCTTTAAAO ACT OCA OAT AOT GOT OAA GOT GAC 1177 

Tbr Ala Aip Ser Oly Gl« Ol y Asp 

20 25 

TTT CTA OCT G A A GO A OOA GOC OTO COT GOC CCA AGO GTT OTO OAA AO A 1225 

Pbc Lew Ala Gin Gly Gly Oly Val Arg Oly P*o Arg Vil V* 1 OU Arg 

3 0 3 5 40 

CAT CAA TCT GCC TOC AAA OAT TCA GAC TOO CCC TTC TGC TCT OAT OAA 1273 

Hi* 0 1* Ser Ala Cy» Ly» Asp S.r Asp Trp Pro Phe Cy i S«r Aip 01» 

45 30 55 

GAC TOG OTAAOCAGTC AOCOOGOGAA OCAOGAGATT CCTTCCCTCT O ATOCT AOAG 1329 
A * p Trp 
6 0 

OGOCTCACAG OCTGACCTOA TTOOTCCCAO AAACTTTTTT A A AT AO A A A A TAATTGAATA 13*9 

OTTACCTACA TAOCAAATAA AOAAAAOOAA CCTACTCCCA AOAGCACTGT TTATTTACCT 1449 

CCCCAACTCT OGATCATTAG TOOGTGAACA OACAOGATTT CAOTTGCATO CTCAOOCAAA 1509 

ACCAOOCTCC TOAOTATTOT OOCCTCAATT TCCTOOCACC TATTTATOOC TAAOTOGACC 1569 

CTCATTCCAO AOTTTCTCTO COACCTCTAA CTAOTCCTCT TACCTACTTT TAAOCCAACT 1629 

TATCTOGAAO AGAAAOGOTA OOAAGAAATO OOOOCTOCAT OOAAACATOC AAAATTATTC 1619 

TOAATCTOAO AOATAGATCC TTACTOT AAT TTTCTCCCTT CACTTTCAO AAC TAC 1744 

Asa T y r 

AAA TOC CCT TCT GOC TOC AGO ATO AAA OOG TTG ATT OAT OAA OTC AAT 1792 

Lys Cys Pro Ser Oly Cys Arg Met Lyi Oly Lett 11* Asp OU Val A • a 

6 3 7 0 7 5 

CAA OAT TTT ACA AAC AO A ATA AAT A AO CTC AAA AAT TCA CTA TTT OAA 1*40 

Ola A»p Pbe Tbr Asa At| I 1 « Asa Lyi Lett Lyi A»a Scr Lett Pfa« Ola 

8 0 * 3 9 0 

TAT CAO A AG AAC AAT A AO GAT TCT CAT TCO TTO ACC ACT AAT ATA ATO 1SS* 

Tyr Ola Lyi Asa Asa Lys Asp S«r His S«i Leo Tbr Tbr Asa II* Mat 

95 100 105 110 

OAA ATT TTO AG A GOC OAT TTT TCC TCA OCC AAT A OTAAOTATTA 1932 
OU lie Leu Arg Oly Asp Pb« Ser Ser Ale Asa 
115 120 

CATATTTACT TCTTTOACTT TATAACAOAA ACAACAAAAA TCCT AAATAA A TATGA T ATC 1992 
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TATOACAATT 


TCATCCCAAA 


GTACTTAOTG 


TAGAAACACA 


TACCTTCATA 


20 S 2 


AAATTTTAAO 


AGGGAGCTTT 


TOTTTTCOTT 


ATTTTTTCAA 


AOTAAAAGAT 


2 112 


ATTGTTTAAG 


OTCACAAAAT 


AAGTCAGAAT 


TTTOOATTAA 


AACAAOAATT 


2 17 2 


CTTTTCAACA 


OTATATACTO 


AAAOTAOOAT 


OOOTCAOACT 


CTTTGAOTTO 


2 2 3 2 


TTCTOCTTTO 


TAAAOOTOAA 


AACTOAOAOO 


TC AAGGAACT 


TGTTCAAAOA 


2 292 


OOAATTCAAC 


TCCCAOACTC 


CACTOAOCTO 


ATT AGO TAG A 


TTTTT AAATT 


2 3 5 2 


OOTCAAOCTA 


CGTCATTCTC 


ACAOTCTACT 


C ATT AG GOT T 


AOGAA AC ATT 


2 4 12 


TOOOCATOOA 


CAGCGAOTCT 


AOOGAGTCCT 


CAOTTTCTCA 


AGTTTTGCTT 


2472 


CACCTTCACA 


AACACTTOAC 


ATTT AAAATC 


AGTGATOCCA 


ACACTAGCTO 


25 3 2 


OATCCTOTTO 


ACCCAAAACA 


OCTTAOOAAC 


C ATTTC AA AT 


CTATAGAOTT 


2 5 92 


OCTCATCAOT 


A AG A AAA T CC 


AAT ATGTTC A 


AGTCCCTTOA 


TTAAOGATGT 


2 6 5 2 


TTQAAATflCA 

A A W erfA « am A W •» 


ATCAAACC A A 


CTATTTT AAC 


TCC AAATTAC 


ACCTTTAAAA 


2 7 12 


AOTTCT TCTT 


CTATATTTCT 


TTGOGATT AC 


T AAT TO C TAT 


T AGO A CAT CT 


2 7 7 2 


TCATOGAAOG 


CTGC AGGGCA 


TAACATTATC 


CAAAAGTCAA 


ATGCCCCAT A 


2 S 3 2 


TCACAOATTA 


AACTOTAACC 


AAA AT AAA AT 


TAGGCATATT 


TACAAGCTAG 


2*92 


TCTTTTTTCT 


CTTTCTTTCT 


TTCTTTCTTT 


CTTTCTTTCT 


TTCTTTCTTT 


2 9 5 2 


TTCTCCTTCC 


TTCCTTTCTT 


CCTTTCTTTT 


TTOCTOGCAA 


TTACAGACAA 


3 0 12 


AGCTACTTCA 


ATAACCATAT 


T-TT CO ATTTC 


AG AC COT 
A » a Ai| 


OAT AAT 
Alp A • a 


3065 
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ACC TAC AAC CGA GTG TC A GAG OAT CTO AOA AGC AOA ATT GAA OTC CTO 3 113 

Tor Tyr Ais Af f Val S«i Ola Aip Lea Arg Sat Arg Ila Ola Va 1 Lea 

13 0 13 5 140 

A AO COC AAA OTC ATA OA A AAA OTA CAO CAT ATC CAO CTT CTO CAO AAA 3 161 

Lyi Arg Lyi Val lie Ola Ly* Val Ola Hii I 1« Ola Laa Lea Ola Lyi 

14 5 15 0 15 5 

AAT OTT AOA OCT CAO TTO OTT OAT ATO AAA COA CTO OAO OTAAOTATOT 32 10 

Ata Val Arg Ala Gla Leo Val Aip Met Lyi Azg Lett Ola 

160 16 5 170 

GOCTOTOOTC CCOAOTOTCC TTOTTTTTOA OTAOAOOOAA AAOOAAOOCO ATAOTTATOC 3 270 

ACTOAOTOTC TACTATATOC AOAOAAAAOT OTTATATCCA TC ATCTACCT AAAAOTAOGT 3330 

ATTATTTTCC TCACTCCACA OTTOAAOAAA AAA A A AT T C A GAOAT ATTAA GT AAATTTTC 3 3 90 

CAACOTACAT AOATAOTAAT TCAAAOCAAT GTTCAOTCCC TOTCTATTCC AAGCCATTAC 3450 

ATCACCACAC CTCTOAOCCC TCAOCCTOAO TTCACCAAOO ATCATTT AAT TAOCOTTTCC 35 10 

TTTOAOAOOO AATAOCACCT TACTCTTOAT CCATTCTGAG OCT A AG A TO A ATTAAACAGC 3 5 70 

AT CC ATTOCT .TATCCTOOCT AGCCCTOCAA TACCCAACAT CTCTTCCACT OAOGOTOCTC 3630 

GATAOOCAOA A AAC AO AO A A TATTAAOTOO TAOOTCTCCO AOTC AAA AA A A ATO AA AC C A 3 690 

OTTTCCAOAA OOAAAATTAA CTACCAOOAA CTCAATAOAC OTAOTTTATO TATTTOT ATC 3750 

TACATTTTCT CTTTATTTTT CTCCCCTCTC TCTAO OTO GAC ATT OAT ATT AAO 3103 

Val Aip lie Ai p lie Lye 

173 

ATC CGA TCT TOT COA OGG TC A TOC ACT AGO OCT TTA OCT COT GAA OTA 3 15 1 

I 1 « Atg Ser Cys Arg Oly Ser Cys Ser Arg Ala Lea Ala Arg Ola Val 

ISO 1 « 5 190 

OAT CTO AAO OAC TAT 0 AA OAT CAO CAO AAO CAA CTT GAA CAO GTC ATT 3 a 99 
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Asp La* Ly* Aip Tyr Ola Asp Ola Ola Ly. Ola L • * Ola Ola V* 1 He 

193 400 105 

OCC AAA OAC TTA CTT CCC TCT AOA OAT AGO CAA CAC TTA CCA CTO ATA 3947 

Ala Lyi Aip Lea Laa Pro Sar An Aip Af| Ola Hii taa Pro Lea Ila 

210 215 24° 

AAA ATO AAA CCA OTT CCA OAC TTO OTT CCC OOA AAT TTT AAO AOC CAO 3 9 95 

Ly* Met Ly» Pro Val Pro A»p Leo Val Pro Oly A»a Phe Lyi Sar Ola 

225 230 235 240 

CTT CAO AAO OTA CCC CCA OAO TOO AAO OCA TTA ACA OAC ATO CCO CAO 4043 

Leo Ola Lyi Val Pro Pro Ola Tip Lyi Ala L«« Thr Aip Mat Pta Ola 



245 



250 235 



ATO AOA ATO OAO TTA OAO AOA CCT GOT OOA AAT OAO ATT ACT COA OOA 409 1 

Met Atg Met Olo Lao Olo Arg Pro Oly Oly Asa Olo Ila Thr Arg Oly 

260 265 270 

OOC TCC ACC TCT TAT OOA ACC OOA TCA OAO ACO OAA AOC CCC AGO AAC 4 139 

Oly Set Thr Ser Tyr Oly Tbr Oly Sar OU Thr Ola Ser Pro Arg Aia 

275 2 10 2t5 

CCT AOC AOT OCT OOA AOC TOO AAC TCT OOO AOC TCT OOA CCT OOA AOT 4 1 >7 

Pro Ser Ser Ala Oly Ser Tip A*a Sar Oly Sar Ser Oly Pro Oly Sar 

290 295 300 

ACT OOA AAC COA AAC CCT OOO AOC TCT OOO ACT OOA OOO ACT OCA ACC 42 3 5 

Thr Oly A*a Arg Asa Pro Oly Ser Sot Oly Thr Oly Oly Thr Ala Thr 

305 310 315 * 320 

TOO AAA CCT OOO AOC TCT OOA CCT OOA AOT OCT OOA AOC TOO AAC TCT 42t3 

Trp Lyi Pro Oly Ser Ser Oly Pro Oly Ser Ala Oly Sar Tip Aia Ser 

325 330 335 

OOO AOC TCT OOA ACT OOA AOT ACT OOA AAC CAA AAC CCT OOO AOC CCT 433 1 

Oly Ser Ser Oly Thr Oly Ser Thr Oly A«a Ola Asa Pro Oly Ser Pro 

340 345 350 

AOA CCT OCT AOT ACC OOA ACC TOO AAT CCT OOC AOC TCT OAA COC OOA 43 79 

Arg Pro Oly Ser Thr Oly Thr Trp Ate Pro Oly Ser Ser Olo Arg Oly 

355 3 6 0 365 

AOT OCT OOO CAC TOO ACC TCT OAO AOC TCT OTA TCT OOT AOT ACT OOA 4427 

Ser Ala Oly Hit Trp Thr Ser Olo Ser Ser Vel Ser Oly Ser Tbr Oly 

370 375 3S0 

CAA TOO CAC TCT OAA TCT OOA AOT TTT AOO CCA OAT AOC CCA OOC TCT 4475 

Ola Trp Bit Sar Ola Ser Oly Ser Ph e Arg Pro A»p Sar Pro Oly Sar 

3S5 390 395 400 

OOO AAC OCO AOO CCT AAC AAC CCA OAC TOO OOC ACA TTT OAA OAO OTO 4523 

Oly A*a Ala Arg Pro Asa Aia Pro A*p Trp Oly Thr Phe Ola Ola Val 

405 410 415 

TCA OOA AAT OTA AOT CCA OOO ACA AOO AOA OAO- TAC CAC ACA OAA AAA 4371 

Ser Oly Ata Val Ser Pro Oly Thr Arg Arg Olo Tyr Hii Thr Olo Ly* 

420 425 430 

CTO OTC ACT TCT AAA OOA OAT AAA OAO CTC AOO ACT OOT AAA OAO AAO 46 19 

Leo Vel Thr Ser Lyi Oly A*p Lyi Olo Leo Arg Thr Oly Lye Olo Lye 

435 440 445 

OTC ACC TCT OOT AOC ACA ACC ACC ACO COT COT TCA TOC TCT AAA ACC 4667 

Val Tbr Sar Oly Sar Thr Tbr Tbr Thr Arg Arg Sar Cyi Sar Lyi Thr 

450" 455 460 

OTT ACT AAO ACT OTT ATT OOT CCT OAT OOT CAC AAA OAA OTT ACC AAA 47 15 

Val Thr Ly* Thr Val lie Oly Pro A*p Oly Hi* Ly* Ola Val Thr Lyi 

465 470 473 4g0 

OAA OTO OTO ACC TCC OAA OAT OOT TCT OAC TOT CCC OAO OCA ATO OAT 4763 

Olo Val Vel Thr Ser Ola A*p Oly Ser A»p Cy i Pro Olo Ala Met A*p 

4 15 490 495 

TTA OOC ACA TTO TCT OOC ATA OOT ACT CTO OAT OOO TTC COC CAT AOO 41 Jl 

Leo Oly Thr Leo Ser Oly lie Oly Thr Leo A*p Oly Pha Arg Hi* Arg 

300 505 510 

CAC CCT OAT OAA OCT OCC TTC TTC OAC ACT OCC TCA ACT OOA AAA ACA 4159 
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Hi. Ft. A., Ol. Al. Al. Pb. Pfc. A.p Tfc. Al. S.. Thr Oly Ly. Ik. 

- - - 5 2 5 



5 1 5 



5 20 



TTC CCA OCT TTC TTC TCA CCT ATO TTA OOA OAO TTT OTC ACT OAO ACT 
p he Pro Ol, Ph. Phe S.« Pro Met L.. Oly Ol. Ph. V.l S.r Ol. Thr 

5 4 0 



530 



535 



OAO TCT AGO OOC TCA OAA TCT OOC AT C TTC ACA AAT ACA AAO OAA TCC 
.x An Oly S.r Ol. S.i Oly U. Pk* Thr A.. Thr Ly. Ol. S.t 



O 1 « 
5 4 5 



3 5 5 



AOT TCT CAT CAC CCT OOO ATA OCT OAA TTC CCT TCC COT OOT AAA TCT 
S.r Ser Hi. Hi. Pro Oly Il« Al* Ol. Ph. Pro - - - " 1 - 



5 6 5 



5 70 



«r Arg Oly Lyi S . r 

5 7 5 



TCA AOT TAC AOC AAA CAA TTT ACT AOT AOC ACQ AOT TAC AAC AO A OOA 
Ser S.r Tyr S.r Ly. Ol. Phe Thr S.r S.r Thr S.r Tyr A»* Arg Oly 
319 590 



5 »0 



OAC TCC ACA TTT OAA AOC AAO AOC TAT AAA ATO OCA OAT OAO OCC OOA 

A.p S.r Thr Phe Oln Ser Ly. Ser Tyr Lyi M. t Al. A.p Ol. Ala Oly 

595 600 60 5 

AOT OAA OCC OAT CAT OAA OOA ACA CAT AOC ACC AAO AOA OOC CAT OCT 

S.r Ol. Al* A.p Bit Ol* Oly Thr Hi. S.r Thr Ly. Arg Oly Hi. Al. 

6 2 0 



6 1 0 



6 15 



AAA TCT CGC CCT OTC AOA OOT AT C CAC ACT TCT CCT TTO OOO AAO CCT 
«r Arg Pro V* 1 Arg Oly II* Hi. Thr S.r Pro L*. Oly Ly s Pro 

■* * * 640 



Ly » 

625 



6 3 0 



635 



TCC CTO TCC CCC TAOACTAAOT TAAATATTTC TOCACAOTOT TCCCATOOCC 
Ser Leo S.r Pro 

6 45 

CCTTOCATTT CCTTCTTAAC TCTCTOTTAC ACOTCATTOA A AC T AC ACTT TTTTOOTCTO 

TTTTTOTOCT AGACTGTAAG TTCCTTOOGG OCAOOOCCTT TOTCTOTCTC ATCTCTOTAT 

TCCCAAATOC CT A AC AO TAC AOAOCCATOA CTCAATAAAT AC ATOTT AAA TOOATOAATO 

AATTCCTCTO AAACTCTATT TOAOCTTATT TAOTCAAATT CTTTCACTAT TCAAAOTOTO 

TOCTATTAOA ATTOTCACCC AACTOATTAA TCACATTTTT AOT ATOTOTC TCAOTTOACA 

TTTAOOTCAO OCTAAATACA AOTTOTO TT A OTATTAAOTO AOCTT AOCT A CCTOTACTOO 

TT ACTTOCT A TTAOTTTOTO C AAO T A A AAT TCCAAATACA TTTOAOOAAA ATCCCCTTTO 

CAATTTGTAO GTATAAATAA CCOCTTATTT OCATAAOTTC TATCCCACTO TAAOTOCATC 

CTTTCCCTAT OOAOOOAAOO AAAGGAOOAA OAAAOAAAOO AAOOOAAAOA AACAOTATTT 

GCCTTATTTA ATCTOAOCCO TOCCT AT CT T TOTAAAOTTA AATOAOAATA ACTTCTTCCA 

ACCAOCTTAA TTTTTTT TTT AOACTOTOAT OATOTCCTCC AAACACATCC TTCAOOTACC 

CAAAGTGGCA TTTTCAATAT CAAOCTATCC OOATCC 



4907 

49 5 5 

5003 

5 0 5 1 

5 099 

5 147 

5 19 5 

5 247 

5 3 0 7 
5 3 67 
3 42 7 
5417 
5 547 
5 60 7 
5 667 
5 727 
5 7 17 
5 • 47 
5 907 
5 9 4 3 



( 2 ) ENPDRMAHON P0R. SBQ ID 

( i ) SEQUENCE CHAJUCTESOSTICS: 

( A ) LENGTH: 644 mam *ddi 
( B )TYPB: mnoo add 
(D)TOFOLOOY:fi***T 



< i i )1 

( x i )SBO»JENCEDESCIUPTlON:SBQlDNa2: 

Met Ph. Ser Met Arg Xle V.l Cyi Le* V.l L«. Ser V* 1 V.l Oly Thr 

15 I© 13 

Al* Trp Thr Al* A.p Ser Oly Ol. Oly A.p Ph. L.. Al. Ol. Oly Oly 

2 0 2 5 5 0 

Oly V»l Arg Oly Pro Arg V*l V.l 01. Arg Hii Ol. S.r Al* Cyi Ly. 
35 4 0 45 



27 



28 



-continued 



Asp 


Ser 


A » p 


Trp 


Pro 


Pb e 


Cy $ 


S e r 


Asp 


Ol A 


A«p 


Trp 


Aft A 


Ty r 


Ly • 


Cy • 


50 










55 










60 










Pro 


S e i 


O 1 y 


Cy ■ 


Ar g 


Me t 


Ly s 


Ol y 


Los 


X 1 A 


Asp 


Ol* 


Va I 


Asa 


O 1 A 


Aftp 


65 








70 










75 










to 


Pb « 


Thi 


A ft A 


Ar g 


I 1 * 


Asa 


Ly s 


Lev 


Ly s 


Asa 


Sat 


Lev 


Pb e 


Ol tt 


Ty r 


Ol A 










15 










90 










95 




L y * 


Asa 


A t A 


Ly • 


Asp 


S e r 


H i s 


S e r 


Leo 


Tbr 


Tbr 


Asa 


I 1 e 


Me t 


Ol « 


I 1 A 






100 










1 05 










1 1 0 






Lev 


Arg 


G 1 y 


Asp 


Pb • 


S c r 


S e r 


Al A 


Asa 


Asa 


Ar g 


Asp 


A ft A 


Tbr 


Ty r 


Aft A 






1 1 5 










1 2 0 










125 








A i f 


V i I 


Ser 


GU 


Asp 


Lev 


Ar g 


S e r 


Ar g 


I 1 e 


O 1 v 


V* 1 


Lev 


Ly » 


Arg 


Ly • 




1 30 










1 3 5 










140 










Vi 1 


I 1 c 


O 1 A 


Ly » 


V» 1 


G 1 A 


H i s 


I 1 e 


Ol A 


Lib 


Lev 


Ol A 


Ly » 


A ft A 


Vil 


Arg 


1 4 5 








1 50 










155 










1 60 


A 1 a 


G 1 A 


Leo 


V» 1 


Asp 


Me t 


Ly * 


Ar g 


Lea 


G 1 a 


V* 1 


Asp 


I 1 e 


Asp 


1 1 e 


Ly » 










1 6 5 










1 70 










1 7 5 




I 1 c 


Ail 


Set 


Cy s 


Ar g 


O 1 y 


S c x 


Cy ■ 


S e r 


Ar g 


Al A 


Lev 


Al A 


Arg 


OX « 


Va 1 








1 t 0 










1 15 










19 0 






Alp 


Leo 


Ly i 


Asp 


Ty r 


G 1 A 


A t p 


O 1 A 


O 1 A 


Ly • 


Gl A 


Lev 


O 1 « 


Ol A 


Va 1 


I 1 e 






1 9 3 










2 0 0 










205 








A 1 ft 


Ly t 


Asp 


Leo 


Lea 


Pro 


S e r 


Ar g 


Asp 


Ar g 


Gift 


Hi I 


Lav 


Pro 


Lett 


lie 




2 1 0 










2 1 5 










2 2 0 










Lyi 


Me t 


Ly * 


Pro 


Va 1 


Pro 


A * p 


L e s 


V A 1 


Pro 


O 1 y 


Asa 


Pb a 


Ly • 


Ser 


Ol A 












2 3 0 










235 










2 40 


Leo 


G 1 a 


Ly s 


V A 1 


Pro 


Pro 


G 1 A 


Trp 


Ly * 


A 1 A 


Lev 


Tbr 


Asp 


Me t 


Pro 


Ol A 








2 4 5 










2 5 0 










2 5 5 




Me t 


Ar g 


Me t 


O I a 


L • D 


G 1 o 


A r g 


Pro 


O 1 y 


Oly 


A S A 


O 1 A 


X 1 e 


Tbr 


Arg 


Ol y 








2 60 










2 6 5 










2 7 0 






G 1 y 


Ser 


Th r 


S e x 


Ty r 


G 1 y 


T b t 


Oly 


So r 


O 1 A 


Tbr 


O 1 A 


Ser 


Pro 


Arg 


Aft A 






275 










2*0 










2*5 








P I © 


S e r 


S e r 


Al A 


Oly 


S c r 


Trp 


Asa 


S * r 


Oly 


Ser 


S e r 


Oly 


Pro 


Oly 


Ser 




2 9 0 










2 9 5 










3 0 0 










Tkt 


O 1 y 


A 1 A 


A r g 


Asa 


P r o 


Oly 


S e r 


S e r 


Oly 


Tb r 


oi y 


Oly 


Tb r 


A 1 A 


Tbr 


3 0 5 










3 1 0 










3 1 5 










320 


T r p 


Ly • 


Pro 


O 1 y 


Set 


S e r 


GI y 


Pro 


Oly 


S e r 


A 1 A 


Oly 


Ser 


Trp 


Asa 


Ser 










3 2 5 










3 3 0 










3 3 5 




O 1 y 


Set 


S e r 


G 1 y 


Tbr 


O 1 y 


S e r 


Tbr 


O 1 y 


A S A 


0 1 A 


A S A 


Pro 


Oly 


Ser 


Pro 








3 4 0 










3 4 5 










3 5 0 






At g 


Pro 


O 1 y 


Sot 


Tbr 


G 1 y 


Tb r 


Trp 


Asa 


Pro 


o l y 


Ser 


Ser 


0 1 a 


Ar g 


Oly 






3 5 5 










3 6 0 










365 








S e r 


Al ft 


01 y 


B i s 


Trp 


Tbr 


S e r 


0 1 A 


5«r 


S e r 


Va 1 


Ser 


Oly 


Ser 


Tbr 


Oly 




3 70 










3 73 










3 g 0 










0 1 i 


Trp 


Hi • 


S*r 


Oil 


S e r 


Oly 


So r 


Pbe 


Ar g 


Pro 


A«p 


Ser 


Pro 


Oly 


Ser 


3 15 










3 9 0 










3 95 










400 


O 1 y 


A • • 


A 1 A 


A r g 


Pro 


A » a 


All 


Pro 


A» p 


Trp 


Ol y 


Tb r 


Pbe 


O 1 « 


O 1 A 


Va 1 










4 0 5 










4 I 0 










4 1 5 




Ser 


O I y 


A« A 


V* 1 


Set 


Pro 


Ol y 


Tbr 


Ar g 


Ar g 


G 1 A 


Ty r 


Hi • 


Tbr 


Ol v 


Ly s 








4 2 0 










4 2 5 










4 3 0 






Lev 


Va 1 


Tb r 


S e r 


Ly s 


Oly 


Asp 


Ly • 


O 1 D 


Leo 


Ar g 


Tbr 


Oly 


Ly . 


O 1 « 


Ly • 






43 5 










4 40 










445 








Vt 1 


Tb r 


S « r 


G 1 y 


S e r 


Tb r 


Tbr 


Tbr 


Tbr 


Ar g 


Ar g 


Ser 


Cy • 


Ser 


Ly i 


Tb r 




450 










45 5 










46 0 










Va I 


Tbr 




Tb r 


V. I 


I 1 c 


Oly 


Pro 


Asp 


Oly 


Hi • 


Ly s 


Ol o 


Va 1 


Tb r 


Ly * 



4 7 5 



29 30 

-continued 

Ola Val Val Thr Sar Ols Atp Oly Ser Asp Cy t Pio Ola Ala Mar Asp 
4S5 490 4»S 

Lev Oly Tki Lev Ser Oly lit Oly Thr Let Atp Oly Phe Art Hi* At § 

500 505 510 

Hit Pro Atp Ola Ala Ala Phe Pht A»p Tar Ala Ser Tar Oly Lyt Thr 

515 520 515 

Pbe Pro Oly Phe Pb« S«r Pro Mat Lea Oly Ola Pbe Val Sar Ola Tbr 

530 535 540 

Ola Sar Arc Oly Sar Ola Sar Oly 1 1* Pba Thr Asa Tbr Lyt Ola Sar 
543 550 335 560 

Ser Sar Bit Hit Pro Oly XI* Ala Ola Pba Pro Sar Ai| Oly Lyt Sax 

565 570 575 

Set Sex Tyr Sar Lyt Ola Pbe Thr Sar Sar Thr Sar Tyr An Art Oly 
5 SO 5S5 590 

Atp Sar Tbr Pbe Ola Sar Lyt Sar Tyr Lyt Mat Ala Atp Ola Ala Oly 

595 600 605 

Ser Glo Ala Atp Bit Ola Oly Thr Bit Sar Tbr Lyt Art Oly Hit Ala 
6 10 6 15 620 

Lyt Sar Arc Pro Val Arc Oly lit Hit Tbr Sar Pro Lao Oly Lyt Pro 

625 630 635 640 

Ser Leo Ser Pro 

( 2 ) INFORMATION FOR SBQ ID N03: 

< i ) SEQUENCE CHARACIEWSTICS: 
( A ) LENGTH: 387* bate pan 
(B ) TYPE: aociok add 
( C )STIWNDEDNESS: doable 
( D ) TOPOLOGY: faeer 

( i i ) MOLECULE TYPE: DMA <| i irnnir) 

( v i i ) IMMEDIATE SOURCE: 
( B ) CLONE: box 

( i x ) FEATURE: 

( A ) NAME/KEY: aatcJtNA 
( B ) LOCATION: 1-469 

( i x ) FEATURE: 

( A ) NAME/KEY: asm 
( B ) LOCATION: 470-5O 

( i x ) FEATURE: 

( A ) NAME/KEY: bttoa 
( B ) LOCATION: 514-3257 

( i x ) FEATURE; 

( A ) NAME/KEY: mm 
( B ) LOCATION: 333*3449 

( x x ) FEATURE: 

( A ) NAME/KEY: bra 
( B ) LOCATION: 3450-393* 

( t x ) FEATURE: 

( A ) NAME/KEY: exoa 
( B ) LOCATION: 3939-4122 

( i x ) FEATURE: 

( A ) NAME/KEY: bma 
( B ) LOCATION: 4123-5042 

( i x ) FEATURE: 

( A ) NAME/KEY: exoa 
( B ) LOCATION: 5043-5270 



( i X ) FEATURE: 



31 32 

-continued 

( A) NAME/KEY: im 
( B ) LOCATION: 2271~5S30 

{ i z ) FEATURE: 

( A ) NAME/KEY: cxot 

( B ) LOCATION: 5*31-5944 

( i x ) FEATURE: 

( A ) NAME/KEY: aurco 
( B ) LOCATION: 5945-6632 

( i x ) FEATURE: 

( A ) NAME/KEY: exec 

( B ) LOCATION: 6633..675S 

( i x ) FEATURE: 

( A ) NAME/SET: warn 
{ B ) LOCATION: 6759L6966 

( i x ) FEATURE: 

( A ) NAME/KEY: exoa 

( B ) LOCATION: 6967-7252 

( i x ) FEATURE: 

( A ) NAME/KEY: iom 
( B ) LOCATION: 7253_7S70 

( i x ) FEATURE* 

( A ) NAME/KEY: axon 

( B ) LOCATION: 7*7t-S102 

( i x ) FEATURE: 

( A ) NAME/KEY: 3*UTR 
( B ) LOCATION: *103-*S37 

( i x ) FEATURE: 

( A ) NAME/KEY: suac—RNA 
( B ) LOCATION: B53&JS7S 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 

( B ) LOCATION: join(470_5*3, 3251.3449, 3939-4122, 
5043-5270, 5*31-5944, 6633-675*, 6967.7252, 
7*71-*102) 

< x i ) SEQUENCE DESCRIPTION: SBQ ID N03: 

OAATTCATOC CCCTTTTOAA ATAOACTTAT OTCATTOTCA OAAAACATAA OCATTTATOO 60 

TATATCATTA ATOAOTCACO ATTTTAOTOO TTOCCTTOTO AOTAOOTCAA ATTTACT AAO 120 

CTTAOATTTO TTTTCTCACA TATTCTTTCO OAOCTTOTOT AOTTTCCACA TTAATTTACC 1*0 

AOAAACAAOA TACACACTCT CTTTOAOOAO TOCCCTAACT TCCCATCATT TTOTCCAATT 240 

AAATOAATTO AAOAAATTTA ATOTTTCTAA ACTAOACCAA CAAAOAATAA TAOTTOTATO 300 

ACAAOTAAAT AAOCTTTOCT OOOAAOATOT TOCTTAAATO ATAAAATOOT TCAOCCAACA 360 

AOTOAACCAA AAATTAAATA TTAACTAAOO AAAOOTAACC ATTTCTOAAO TCATTCCTAO 420 

CAOAOOACTC AOATATATAT AOO AT TO A AO ATCTCTCAOT T AAOTCT AC ATO AAA 473 

M» t Ly I 
1 

AOO ATO OTT TCT TOO AOC TTC CAC AAA CTT AAA ACC ATO AAA CAT CTA 323 
Ar| M • t V«l S«r Tip S«f Thm Bit Lyi L « o Ly i Thi M* t lyi Bit L*« 
5 10 15 

TTA T TO CTA CTA TTO TOT OTT TTT CTA OTT AAO TCC CAA OOT OTC AAC 571 
Leu Leu L«o Lav L*o Cyi V*l Pbt L • -a Vil Ly i 5*f Oil Oly V«l A«ft 

2 0 2 3 3 0 

OAC A AT OAO OAO GTGAATTTTT TAAAOCATTA TTATATTATT AOT AOTATT A 623 
Aip Asa Oln Ola 

3 5 



TTAATATAAO ATOTAACATA ATCATATTAT OTGCTTATTT TAATOAAATT AOC A TTOCTT 



6*3 



33 34 

-continued 

ATAOTT ATGA AATOGAATTO TTAACCTCTO ACTTATTOTA TTT AAAOAAT OTTTCATAOT 74 3 

ATTTCTTATA TAAAAACAAA OTAATTTCTT OTTTTCTAOT TTATCACCTT TOTTTTCTTA t03 

AOATOAOOAT OOCTTAOCTA ATOTAAQATO TGTTTTTCTC ACTTOCTATT CTOAOTACTO i«3 

TOATTTTCAT TTACTTCTAG C AATAC AGO A TTACAATTAA GAGOACAAGA TCTOAAAATC 9 23 

TCACAAACTA TAAAATAATA AAAGAOCAOA AT TTT A AO AT AAAAOAAACT OGTGOT AGOT 9$3 

AGATTOTTCT TTGGTGAAOG AAOOTAATAT ATATTOTTAC TOAOATTACT ATTTATAAAA 1043 

ATTATAACTA AGCCT AAAAG CAAAATACAT CAAGTGT AAT GAT AO AAA AT OAAAT ATTGC 1103 

TTTTTTCAGA TOAAAAGTTC A AATT AG ACT TAGTOTOTAT TOTTATTATT AATAGTTATO 1163 

AAACACGGTT CAOTCTAATT TATTTATTTO T AGAAC AGTT TOTCCTCAAC TATTATTTTT 1223 

GCTGACTTAT TGCTOTTA AT TTOCAGTTAC T AAA A AT AC A G AAATGC ATT TAGGAC AATG 12*3 

GAT ATTTAAG AAATTTAAAT TTTATCATCA AA COT AT CAT GGCCAAATTT CTTACATAT A 13 43 

GCATAGTATC ATTAAACTAG AAATAAGA AT ACACAAT AAT AT TT A AATG A AOTOATTCAT 1403 

TTCGOATCAT TATTOAOTTT CAAGGGAACT T GAG TOT TOT ACTTATCAGA CTCTACATOT 146 3 

AAGAACATAT AGTTAATCTG OTTGTOTOTO T AAAAA CAT A TOGTT AATCT GGTTAAGTCT 15 23 

OOTTAAICAT ATTAOOTAAO AAAAATOTAA AGA ATGTGT A AG A CG A A ATT TTTGT A AAGT 15 S3 

ACTCTOCAAA GCACTTTCAC ATTTCTOCTT AT C A ACT AAA CCTCACAOAO AT AGTTT AAT 1643 

AGTTTAGOCT TTAAAATGGA TTTTOATTAT TCAACAAOTO GCCTTCATAA TTTCTTT AAG 1703 

TOTTTTTCTT TAAGTATATA CTTTCTTT AA ATATTTTTTA AAATTTCCTT TTCTCT AOT A 17 63 

AAGCCAGACC AT C CAT OCT A CCTCTCTAOT GGCACTCTOA AAT A AAAAG A AAATAOTTTT 1123 

CTCTGTTATA ATTGT ATTTG TAATAAGCAG AT OA AT CAC A TTTCTTAAAA TTTGT TTT AG 1SS3 

AGAGGGTAAG CTCTOACTAO OACCATOACT TCAATOTOAA AT AT OT AT AT ATCCTCCGAA 19 43 

TCTTTACATA TTAAGAATOT ATATAGTCAA CTOOTTAAAC AGGAAAATCT OGAACAGCCT 2003 

OOCTGOOTTT TAATCTTAOC ACCATCCTAC TA A ATGTT A A ATAATATTAT AATCTAATG A 2063 

ATAAATOACA ATOCAATTCC A A AT AG AGTT CATCTOATOA CTTCTAGACT CACAAAATTG 2 123 

CAAGAGAOCT CAOTTOTTOC TCAOTTGTTC CAAATCATGT COTTTGTTAA TTTGTAATT A 2 113 

AOCTCCAAAO GATGTATAGC TACTOACAAA AAAAAAAATG AOAATGTAGT T A AT CCA AAT 2243 

CAAAACTTTC CTATTOCAAT OCOTATTTTC TOCTTCATTA TCCTTTAATA TAATATTTTA 2303 

AOTTAOCAAO TAATTTTAAT TACAATOCAC AAGCCTTGAO AATTATTTTA AATATAAGAA 2363 

AATCATAATG TTTGATAAAO AAATCATOTA AOAAATTTCA AGATAATGOT TTAAC AAATA 2423 

ATTTTGTTGA TAGAAGATAA GACTAAAAOT GAAATTCGAA GTOGAOAGGA CACTTAAACT 2413 

OTAOTACTTO TTATOTOTOA TTCCAGTAAA AATAOTAATG AGC ACTT ATT ATTGCC AAG T 2 5 43 

ACTGTTCTOA GGGTACCATA TOCAATAAOT TATTTAATCC T T AC AAT AAT CTTGT AAGGC 2603 

AOATTCAAAC TATCATTACA CTTATTTTAC AOATGAGAAA ACTOGOGCAC AGAT AAAOC A 2663 

ACTTOCCCAA GGTCTCATAG CTOTAAOTCA ACCCTACOGT CAAOACCTAC AAGTAGCCGA 2723 

OCTCCAOAOT ACATTATGAG GOTCAAAOAT TOTCTTATTA CAAATAAATT CCAAOTAOAA 27*3 

TCAACCTTTA AT AAOTCTTT AATOTCTCTT AA AT ATGTTT AT AT AGO AOT CTAATCACCA 2*43 

ATTCACAAAA ATGAAAGTAG OGAAATGATT AACAAT AATC ATAGGA ATCT AACA A TCC A A 2 903 

GTOOCTTGAG AATATTCATT CTTCTTGACA GTATAOATTC TTT ACAATTT COTAAOTTCC 2963 

AATGTATOTT TT AGO A AT AT GAOGTCATTA CT ATT CAT A A TCTOAT ACAO CTTTATCCTA 3 023 

AOOCCTCTCT TTAAAAACTA CACTOCATCA TAOCTTTTTT GTGC AGTTOG TCTTTCT ACT 3 0*3 



35 36 

-continued 

OTTACTOAAC AOTAAOCAAC CTACAOATTC ACTATCACCA ACCAOCCAOT TOATOOATCT 3 143 

TAAGCAAATT ATC AAGCTTG TOATAACCTA AATTATAAAA TOAOOOTOTT GGAATAOTT A 3 203 

CATT CCAAAT CTTCT AT AAC ACTCTOTATT AT ATTTC TOC CTCATTCCTT OT AG GOT 3 260 

oi y 

TTC TTC ACT OCC COT OCT CAT COA CCC CTT OAC AAO AAO AG A OA A OAO 3301 
Pbt Phe Sei All Ar g Oly Hit Ar| Pro Leo Aip Lys Lys An 01b Olu 
40 45 30 35 

OCT CCC AOC CTG AGO CCT GCC CCA CCO CCC ATC AGT GOA GOT OOC TAT 33 5 6 

Alt Pro Ser Leo Arg Pro Ala Pro Pro Pro lie Ser Gly Oly Oly Tyr 
6 0 6 5 70 

COG OCT COT CCA OCC AAA OCA OCT GCC ACT CAA AAG AAA OTA OAA AGA 3404 
Arg Aim Arg Pro Ala Lys Ala Ala Ala Thi Oil Lyi Lyi Val Oil Arg 
75 t0 i5 

AAA GCC CCT OAT OCT OGA OOC TOT CTT CAC OCT OAC CCA OAC CTO 3449 
Lyi Ala Pro Asp Ala Gly Gly Cyi Leo Bit Ala Alp Pro Asp Leo 
»0 9 5 10 0 

OTOGGTOCAC TOATOTTTCT TGCAGTOGTG OCTCTCTCAT OCAGAOAAAO CCTGTAOTCA 3309 

TOGCAOTCTO CTAATOTTTC ACTOACCCAC ATT AC CATC A CTOTTATTTT OTTTOTTTAT 3569 

TTTGOAAATA AAATTCAAAA CATAAACATA TTGOOCCTTT OOT-TTAOOCT TTCTTTCTTO 3629 

TTTTCTTTOO TCTOOGCCCA AAATTT C A A A TTAOOATATO TOOOTOCCAC CTTTCCATTT 36g9 

OTATTTTGCC ACTGCCTTTO TTTAOTTOOT AAAATTTTC A TAGCCCAATT ATATTTTTTC 3749 

TGGGGTAAGT AATATTTTAA ATCTCTATGA G AGT AT GAT G ATGACTTTCG AATTTCTOOT 3109 

CTTACAOAAA ACCAAATAAT AAATTTTTAT OTTOGCTAAT COTATCOCTG AATTTTCCTA 3 869 

TOTGCTATTT TAACAAATGT CCATGACCCA AATCCTTCAT CTAATOCCTQ CTATTTTCTT 3929 

TOTTTTTAO GOO OTG TTO TOT CCT ACA GOA TOT CAO TTG CAA OAO OCT 3977 

Gly Val Lee Cyi Pro Tbr Gly Cyt Ola Leo Gla Olo Ala 

10 5 110 115 

TTO CTA CAA CAO OAA AGO CCA ATC AGA A AT AGT OT T OAT GAG TTA A AT 4025 
Leo Lett Gla Gla Olv Arg Pro lie Arg Ata Ser Val Aip Olo Leo Asa 

12 0 12 5 13 0 

AAC A AT OTG OAA OCT GTT TCC CAO ACC TCC TCT TCT TCC TTT CAO TAC 4073 
Asm Aia Vil Olo Ala V*l S«r Ola Tbr Ser Ser Ser Set Phe Oli Tyr 
13 5 14 0 145 

ATO TAT TTO CTG AAA OAC CTO TOO CAA AAG AGO CAG AAG CAA OTA AAA O 4 122 
Met Tyr Lev Leo Lys Asp Leo Trp Ola Lyi Arg Ola Lys Ola Val Lys 
15 0 15 5 16 0 

OTAOATATCC TTOTOCTTTC CATTCOATTT TCAOCT AT AA AATTOOAACC GTTAG ACTOC 4112 

CACOAOAATO C AT OG TTO TO AOAAGATTAA CATTTCTOOO TT AGT OA AT A OCATTCATAC 4242 

OCTTTTOOOC ACCTTCCCCT OCAACTTOCC AGATAAGCAC TATTCAGCTC TTATTCCC AO 4302 

TCTOACATCA GCAAGTGTGA TTTTCTATGA AAAATTCTAC TATGACTCCT TATTTTAAGT 4362 

ATACAAOAAA CTTOTOACTC AGAAOATAAT ATTTACAOAO T GO A A AAA A A CCCCTAOCAT 4422 

TTATAOTTTT AACATTTOAO OTTTTOAATO AOAOAOTTAT CCATAATATA TTCAATTOTO 4482 

TTOTGOATAA TOACACCTAA CCTGTOAATC TTGAOOTCAO AATOTTG AGT OCTOTTGACT 4542 

TOOTGOTCAG OAAACAOCTA OTOCOTOAOC CTGGCACAOO CATCTCAOTO AOTAGCATAC 4602 

CCACAOTTOO AAATTTTTCA AAOAAATCAA AOGAATCATG ACATCTTATA AATTTCAAGO 4662 

TTCTOCT AT A CTTATGTOAA ATOOATAAAT AAATCAAOCA TATCCACTCT GTAAOATTO A 4722 

ACTTCTCAGA TOOAAGACCC CAATACTOCT TTCTCCTCTT TTCCCTCACC AAAOAAATAA 4782 

ACAACCTATT TCATTTATTA CTGOACACAA TCTTTAOCOT ATACCTATGO TAAATT ACTA 4»42 
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OTATOOTOOT TAGGATTT AT OTTAATTTOT ATATOTCATO COCCAAATCA TTTCCACTAA 4902 

ATATOACTAT ATATCATAAC TOCTTOOTOA TAOCTCAOTO TTTAATAOTT TATTCTCAO A 4963 

A A AT C AAA AT TOTATAOTTA AATACATTAO TTTTATOAOO CAAAAATOCT AACTATTTCT 5 023 

ACATAATTTC ATTTTTCCAO AT AAT OAA AAT OTA OTC AAT OAO TAC TCC 507 1 

Asp Asa Ola Asa Vil Val Asa Ola Tyi Sar 

165 170 

TCA OAA CTO OAA AAO CAC CAA TTA TAT ATA OAT OAO ACT OTG AAT AOC 5 119 

Ser Ola Lea Ola Lyi His Ola Laa Tyt Ila Asp Ola Tar Val Asm Sat 

175 110 1 » 5 

AAT AT C CCA ACT AAC CTT COT OTO CTT COT TCA ATC CTO OAA AAC CTO 5 167 

Asa lie Pro Tar Ass Laa Arg Val Laa At g Sar Ila Laa Ola Asa Lam 

190 195 300 305 

AO A AOC AAA ATA CAA AAO TTA OAA TCT OAT OTC TCA OCT CAA ATO OAA 52 15 

Arg Sar Lyi Ila Ola Lyi Laa Ola Sat Asp Val Sax Ala Ola Mat Ola 

2 10 3 15 330 

TAT TOT COC ACC CCA TOC ACT OTC AOT TOC AAT ATT CCT OTO OTO TCT 5363 

Tyr Cyi Arg Thr Pro Cyi Tat Val Sar Cys Asa Ila ft* Val Val Sax 

225 330 333 

OOC AAA O OTAACTOATT CATAAACATA TTTTT AO AOA OTTCCAOAAO AACTCACACA 33 20 
Gly Lyi 

CCAAAAATAA OAOAACAACA AC AAC AAC AA A A AT OCT AAO TOOATTTTCC C AAC AO ATC A 53 «0 

TAATGACATT AC AOT AC ATC ATAAAAATAT CCTTAOCCAO TTOTOTTTTO OACTOOCCTO 5440 

OTOCATTTOC TOOTTTTOAT OAOCAOOATO OOOCACAOOT AOTCCCAOOO OTOOCTOATO 5500 

TOTOCATCTO COTACTOOCT TGAACAOATO OCAOAACCAC AOAT AOATOT AOAAOTTTCT 5560 

CCATTTTOTO TOTTCTOOOA OCTCATOOAT ATTCCAOOAC ACAAAAOOTO OAOAAOAOCT 5620 

TTGTTCATCC TCTT AOC AO A TAAACOTCCT CAAAACTOOO TTOOACTTAC TAAAOTAAAA 5660 

TOAAAATCTA ATATTTOTTA TATTATTTTC A A AGGT CT AT A AT AAC AC AC TCCTTAOTAA 5740 

CTTATOTAAT OTT ATTTT A A AOAATTOOTO ACTAAATACA AAGTA ATT AT GTCATAAACC 5t00 

CCTOAACATA ATOTTOTCTT ACATTTOCAO AA TOT OAO OAA ATT ATC AOO AAA 3133 

Ola Cys Ola Ola Ila Ila At $ Lyi 

240 245 

OOA OOT OAA ACA TCT OAA ATO TAT CTC ATT CAA CCT OAC AOT TCT OTC 590 1 

Oly Oly Ola Thr Ser Ola Mst Tyr Laa lie Ola Pro Asp Sax Sar Val 

250 233 260 

AAA CCG TAT AOA OTA TAC TOT OAC ATO AAT ACA OAA AAT OOA O 5944 

Lyi Pro Tyr Art Val Tyr Cyt Asp Mat Asa Thr Oia Asa Oly 

265 270 275 

OTAAOCTTTC G AC AOTTOTT OACCTOTTOA TCTOTAATTA TTTOOATACC OTAAAATOCC 6004 

AOOAAACAAO OCCAOOTOTO OTOOCTCATA CCTOTAATTC CAOCACCTTO OOAOOCCAAA 6064 

OTOOOCTOAT AOCTTOAOCC TAOOAOTTTO AAACTAOCCT OOOCAACATA ATOAOACCCT 6 134 

AACTCTACAA AAAAAAAAAA AATACCAAAA A AAA A A AAA A AAT C AOC TOT OTTOOTAOTA 616 4 

TOTOCCTOTA OTCCCAOCTA TCCAOOAOOC TOAOATOOOA OATCACCTOA OCCCACAACC 6244 

TOOAOTCTTO ATCATOCTAC TOAACTOTAG CCTOOOCAAC AO AOO AT AOT GAG AT CCT GT 6304 

CTCAAAAAAA AAAATTAATT AAAAAGCCAG OAAACAAGAC TTAOCTCTAA CATCTAACAT 6364 

AOCTOACAAA OOAOT AATTT GATOTOO AAT TCAACCTOAT ATT T AAAAOT TATAAAATAT 6424 

CTATAATTCA CAATTTOOOO T A AG AT A A AO CACTTOCAOT TTCC AAAOAT TTTACAAOTT 64S4 

TACCTCTCAT ATTT ATTTCC TTATTOTOTC T ATTTT AOAO CACCAAATAT ATACTAAATO 6544 

OAATOGACAG OOOATTCAOA TATTATTTTC AAAOTGACAT TATTTOCTOT TOOTTAATAT 6604 
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40 



ATGCTCTTTT TGTTTCTGTC AACCAAAO OA TOO ACA OTO ATT CAO AAC COT 



««55 



Oly Tip Thi Vil IU Ols Asa Atg 



2 SO 



2IS 



CAA OAC GOT AOT OTT OAC TTT OOC AOO AAA TOO OAT CCA TAT AAA CAO 
Ola A*p Oly Set V* I Asp Phi Oly Ai| Lys Tip Asp Fro Tyi Ly i Ola 

290 295 300 

OOA TTT OOA AAT OTT OCA ACC AAC ACA OAT GOO A AG AAT TAC TOT OOC 
Oly Pbe Oly Asa V* 1 Ala Tit Asa Tfar Asp Oly Lys Asa Tyr Cy» Oly 
305 310 315 

CTA CCA O OTAACOAACA OOCATOCAAA ATAAAATCAT TCT ATTTGAA ATOOOATTTT 

Lea P i o 

TTTTAATTAA AAAAC ATTCA TTGTTGOAAO CCTOTTTTAO OCAOTTAAOA OOAOTTTCCT 

GACAAAAATO TOOAAGCTAA AOATAAGGGA AGAAAOOCAO TTTTTAOTTT CCCAAAATTT 

TATTTTTOOT OAOAOATTTT ATTTTOTTTT TCTTTT AO OT GAA TAT TOO CTT 

Oly Ola Tyr Tip Las 

320 

OOA AAT OAT AAA ATT AOC CAO CTT ACC AOO ATO OOA CCC ACA OA A CTT 
Oly Asa Asp Lys lie Sti Ola Leu Thr Art Mat Oly Pro Tar Ola Lea 



3 25 



3 3 0 



335 



3 40 



TTO ATA GAA ATG OAO OAC TOG AAA OOA OAC AAA OTA A AO OCT CAC TAT 
Leo lie Gla Met Ola Asp Tip Lys Oly Asp Lys Vil Lys Ala His Tyr 



345 



3 5 0 



355 



GO A OOA TTC ACT OTA CAO AAT OA A OCC AAC AAA TAC CAO AT C TCA OTO 
Oly Oly Pbe Thr V* 1 Ola Asa Ols Ala Asa Lys Tyr Ola lie Set Val 



3 65 



3 70 



AAC AAA TAC AO A OOA ACA OCC OOT AAT OCC CTC ATO OAT OOA OCA TCT 
Asa Lys Tyr Art Oly Thr Ala Oly Asa Ala Lao Mat Asp Oly Ala Ser 



375 



3 t 0 



3*5 



CAO CTO ATO OOA OA A AAC AOO ACC ATO ACC ATT CAC AAC OOC ATO TTC 
Ola Lea Met Oly Ola Asa Art Thr Met Thr lie His Asa Oly Mat Pbe 



3 9 0 



3 9 3 



TTC AOC ACQ TAT G AC AO A OAC AAT OAC OOC 
Pbe Ser T b r Tyr Asp Atg Asp Asa Asp Oly 



40 5 



4 1 0 



4 00 

TO O T A TO TO TOO 
Tr p 
41 3 



CACTCTTTOC TCCTOCTTTA AAAATCACAC TAATATCATT ACTCAOAATC ATTAACAATA 

TTTTTAATAO CTACCACTTC CTOOGCACTT ACTOTCAOCC ACTOTCCTAA OCTCTTTATO 

CATCACTCOA AAGC ATTTC A ACT AT AAOOT AOACATTCTT ATTCTCATTT TACAOATG AG 

ATTTAOAGAO ATTACGTOAT TTOTCCAATO TCACACAACT ACCCAGAOAT AAA ACT AO AA 

TTTOAOCACA OTT ACTTTCT OAATAATOAG CATTTAOATA AATACCTATA TCTCTATATT 

CTAAAOTOTO TOTOAAAACT TTCATTTTCA TTTCCAOOGT TCTCTOATAC TAAOOGTTOT 

AAAAOCTATT AT TCC AOT AT AAAGTAACAA ACACAOTCCC TAGATOOATT OCCACAAAOO 

CCCAOTTATC TCTCTTTCTT OCTATAOOOC ACAOOAOOTC T TTOGTOT AT TAOTOTOACT 

CTATOTATAO CACCCAAAGO AAAOACTACT OTOCACACOA OTOTAOCAOT CTTTTATOOO 

TAATCTOCAA AACGTAACTT OACCACCOTA OTTCTOTTTC TAATAACOCC AAA CAC AT TT 

TCTTTCAO O TT A ACA TCA OA T CCC AOA AAA CAO TOT TCT AAA GAA OAC 

Lea Tbr Ser Asp Pre Art Lyi Ola Cys Ser Lys Ola Asp 

4 2 0 425 

OOT OOT OOA TOG TOO TAT AAT AOA TOT CAT OCA OCC AAT CCA AAC OOC 

Oly Oly Oly Trp Trp Tyr Asa Arg Cys His Ala Ala Asa Pro Asa Oly 

430 435 440 

AOA TAC TAC TOG OOT OOA CAO TAC ACC TOG OAC ATO OCA AAO CAT OOC 

Arg Tyr Tyr Trp Oly Oly Ola Tyr Thr Trp Asp Met Ala Lys His Oly 



6703 
«75 1 

otOI 

6 16* 
692 * 
69*0 

70 2* 

70 7 6 

7 12 4 



72 2 0 
7 2 62 

7 3 2 2 

73 t 2 
7 442 
7502 
75 62 
7622 
76*2 
7742 
7102 
7*62 
7 9 10 



$006 



41 42 

-continued ^ 

445 430 435 *«° 

ACA OAT OAT OCT OTA OTA TOO ATO AAT TOO AAO OOO TCA TOO TAC TCA *0 3 4 

Thi Aip A.p Oly Val Val Trp Mi t Aia Trp Ly» Oly Sir Try Tyr Sir 

465 470 4 ?S 

ATO AOO AAO ATO AOT ATO AAO ATC AOO CCC TTC TIC CCA CAO CAA TAOTCCCCIU* 

Met Arg *it Sar Mat Lyt Ut Art P * • **• Pr0 ol * ol * 

4*o 415 490 



TACOTAOATT 


TTTOCTCTTC 


TOTATOTOAC 


AACATTTTTO 


TAC ATT ATO T 


TATTOO A ATT 


t 1 6 9 


TTTTTTPiTi 


TATTATATTC 


CTCT AAA ACT 


CTCAAOCAOA 


COTOAOTOTO 


ACTTTTTOAA 


• 2 2 9 


A A AAO TAT AG 


OAT A A AT TAC 


ATTAAAATAO 


CACATOATTT 


TCTTTTOTTT 


TCTTC ATTTC 


t2t9 


TCTTOCTCAC 


CCAAOAAOTA 


ACAAAAOTAT 


AOTTTTO AC A 


OAOTTOOTOT 


TCATAATTTC 


• 3 49 


AOTTCT AOTT 


OATTOCOAOA 


ATTTTCAAAT 


AAOOAAOAOO 


OGTCTTTTAT 


CCTTOTCOTA 


ft 40 9 


OOAAAACCAT 


OACOOAAAOO 


AAAAACTGAT 


OTTTAAAAOT 


CCACTTTTAA 


AACTAT ATTT 


I4«9 


ATTTATOTAO 


OATCTOTCAA 


AO AAAAC TTC 


CAAAAAOATT 


TATTAATTAA 


ACCAOACTCT 


1529 


OTTOCAATAA 


OTT AATOTTT 


TCTTOTTTTO 


TAATCCACAC 


AT TCA ATO AO 


TTAOOC TTTO 


ft 3 S 9 


CACTTOTAAO 


OAAOOAOAAO 


COTTCACAAC 


CTCAAATAOC 


T AAT A A AC CO 


OTCTTO AAT A 


1649 


TTTOAAOATT 


TAAAATCTOA 


CTCTAOOACO 


OOCACOOTOO 


CTCACOACTA 


TAATCCCAAC 


• 709 


ACTTTOOOAO 


QCTQAOOCQQ 


OCOOTCACAA 


OOTCAOOAOT 


TCAAOACCAO 


CCTOACCAAT 


• 769 


ATOOTOAAAC 


CCCATCTCTA 


CTA AAAAT AC 


AAAAATTAOC 


CAOOCOTOOT 


OOCAOOTOCC 


• S 29 


TOTAOOTCCC 


AOCTAOCCTO 


TOAOOTOOAO 


ATTOC ATTOA 


OCCAAOATC 




S S 7 « 



( 2 ) INFORMATION FOR SEQ ID NO* 

( i > SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 491 mum acid* 
(B ) TYPE: amiao acid 
(D ) TOPOLOGY: Saear 

{ i i ) MOLECULE TYPE: proieU 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO* 

Mat L T i Art Mat Val Sar Trp Sar Pha Hit Lyi Lio Lyi Thr Mat Ly» 

,5 10 13 

B i » 



L,b Lao Lao Lao Lao Lao Cyi Val Pha Lao Val Lyi Sar Ola Oly 

20 2 3 3 0 



V.l Aia Asp Asa Olt Oil Oly Faa Paa Sar Ala Arg Oly Bit Art P*© 

35 4 0 4 5 

Lao Aip Lyi Lyi Art Oil Olo Ala Fro Sar Lao Art Fro Ala Fro Pro 

5 0 3 3 6 0 

Pro 11a Sar Oly Oly Oly Tyr Art All Art **• Ala Lyi Ala Ala Ala 

€5 70 75 SO 

Thr Ola Lyi Lyi Val Olo Art Lyi Ala Pro A«p Ala Oly Oly Cyt Lao 

1 5 9 0 9 5 

Hit Ala Aip Pro Aip Lao Oly Val Lao Cyi Pro Tar Oly Cya Ola Lao 

10 0 10 5 110 

Ola Olo Ala Lao Lao Ola Ola Ola Art Pro 11a Art Aia Sar Val Aip 

115 120 123 

Olo L«o A»a An Aia Val Olo Ala Val Sar Oli Thr Sor Sar Sir Sir 

130 13 5 140 

Phe Ola Tyr Mat Tyr Lao Lai Lyi Aip Lai Trp Ola Lyi Art Ola Lya 



1 45 



1 60 



Ola Val Lyi Aip Aia Olo Aia Val Val Aia Ola Tyr Sir Sir Olo Lao 

163 170 175 
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O 1 o 



7b r 



I 1 c 



Tbr 
2 2 5 

Cys 



I 1 « 



At a 

3 0 5 



L y • Bit Ola Lea 
1 SO 

Asa Leo Ai| Va 1 

195 

Oia Ly t Lev Ola 

2 10 

Pro Cyi Tbr Va 1 



Gin Glo lie lie 
245 

Ola Pro Asp Ser 
2 6 0 

Tbr Glo Asa Oly 

275 

Val Asp Pbe Oly 

2 9 0 

Val Ala Tbr Asa 



Glu Tyr Trp Leo 



Pro Tbr Glo 



Leo 

3 40 



L y s Ala 



h i • 

355 



O 1 a 



Ai p 

5 8 5 



I 1 e 
3 7 0 



Tbr Sex Asp 



Trp Tyr 



A s a 

43 5 



o i y 



O 1 y 
4 5 0 



V* 1 
4 6 3 



Ol y 
3 2 5 



Tyr II 

Leo Ar 

Ser As 

2 1 

Ser C y 

23 0 

Ar| Ly 

Ser Va 

Oly Tr 



Arg Ly 

2 9 

Tbr As 

3 1 0 

Asa As 



Leo I 1 « Ol 



Tyr Oly Oly Pb 



Ser Val Asa 



Oly Ala Ser Ola 

Oly Met Pbe 



P r o 
4 2 0 



Arg 



Pbe 
4 0 5 

All 

Cy s 



Ola Tyr Tbr 



Val Trp Met Asa 

Scr Met Ly i lie Art 
4*3 



Ly • Ty 

3 7 

Leo Me 

3 9 0 

Ser Tb 



Ly s 0 1 



His A 1 



Trp As 
4 5 

Trp Ly 
4 7 0 

Pro Pb 



Asp 



Ser 

20 0 



O I o 

115 



Tbr Val Asa 



lie Leo Olo Asa 



Val Ser Ala Ola 



Asa lie Pro 



Va 1 

235 



Oly 

Ly s 

Tbr 
2*0 

Tip 
Oly 



Oly 



Pro 
265 



Olo 
2 5 0 



Asp 
Ly s 



Pro Tyr 



Tyr 
3 1 5 



> e r 

3 3 0 



Me t 

Tbr 
360 

Ar * 

Oly 
Tyr 
Cy s 



A 1 a 
4 4 0 



O 1 o 
3 45 



Oly Tbr Ala 



Olo A s a 



Ar g 

3 95 



Asp 



S « r 
4 2 5 



Ar * 

4 1 0 



Met Ala 



Oly Ser 



Pbe Pro 



Ly s 
Trp 



O I a 
49 0 



Me t 

2 2 0 



Val 



Tbr Ser 



Tyr Arg Val 



Val lie Ola Asa 



Ly s 
3 0 0 

Cy s 



Lys lie Ser Ola Leo 



Asp Trp Lys 



Val Ola Asa Olo 



Oly 
3*0 

Tbr 



Asp Asa 



Lys Olo Asp 

Oly 



Asa Pro Asa 



Tyr 
4 7 5 

O 1 a 



Oly 
4 6 0 



Ser 



Ser Asa lie Pro 
190 

Leo Arg Ser Lys 

205 

Olo Tyr Cys Arg 



Ser Oly Lys Olo 
240 

Olo Met Tyx Leo 

255 

Tyt Cys Asp Met 

270 

Arg Ola Asp Oly 
2*5 

Ola Oly Pbe Oly 



Oly Leo Pro Oly 
3 2 0 

Tbr Arg Me t Oly 

3 35 

Oly Asp Lys Val 
350 

Ala Asa Lys Tyr 

365 

Asa Ala L • o Me t 



Met Tbr lie His 
400 

Asp Oly Trp Leo 
4 1 5 

Oly Oly Oly Trp 
43 0 

Arg Tyr Tyr Trp 
4 45 

Tbr Asp Asp Oly 



Met Arg Lys Met 
4*0 



( 2 ) INFORMATION PGR SBQ ID N03: 

( i ) SEQUENCE CHARACTERISTICS: 

{ A ) LENGTH; 10564 base pan 
( B )TYPE:s 
<C)S 

(D)TOPOLOOY:] 



( i i ) MOLECULE TYPE: DNA (|taoe) 

( t i i ) IMMEDIATE SOURCE: 
( B ) CLONE: boa 



( i x ) FEATURE: 

( A ) NAME/KEY: CDS 

( B ) LOCATION: jma(1799-l*76, 1973*2017, 2207-2390, 2510 
. . 2603, 4211-4341. 4645-^771, 3734-5942, 7426 
. . 7703.9342-9571) 
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(xi ) SEQUENCE DESOUPTJOefcSBQ TO NOA 



CTACACACTT 


CTTOAAOOCA 


AAGOCAATGC 


TOAAOTCACC 


TTTCATOTTC 


A AAT CAT ATT 


6 0 


AAAAAOTTAO 


CAAOATOTAA 


TTATC AOTOT 


ACT ATOT AA A 


TCTTTOTOAA 


TOATCAATAA 


130 


TTACATATTT 


TC ATT AT AT A 


TATTTTAOTA 


OAT AAT ATT T 


AT AT AC ATT C 


AAC ATTCT A A 


ISO 


AT AT AO A A AO 


TTT AC AO AO A 


AAAATAAAOC 


CTTTTTTTCC 


AATCCTOTCC 


TCCACCTCTO 


3 4 0 


CATCCCATTC 


TTCTTCACAO 


AOOCAACTOA 


TTC AAOTC AT 


TAC ATAOTT A 


TTOAOTOTTA 


9 0 0 


ACT ACAACTA 


TOTTAAOTAC 


AGCTATATAT 


OTTAOATOCC 


GTAOCCACAO 


AAATCAGTTT 


3 60 


AC AAT CT AAT 


OCAOTOOATA 


CAOCATOTAT 


ACAT AT AAT A 


TAAOOTTOCT 


ACAAATGCTA 


4 30 


TCTOAOOTAO 


AOCTOTTTOA 


AAOAAT ACTA 


ATACTTAAAT 


OTTT AATTCA 


ACTOACTTOA 


4 • 0 


TTGACAACTG 


ATT AOCTOAO 


TOGA A A AO AT 


GGATGAGA A A 


OAT TO TO AO A 


CTTAATTOOC 


5 4 0 


TOOTOOTATO 


OTOATATOAT 


TGACAATAAC 


TGCTAAOTCA 


GAG AGGG AT A 


T ATT A AGO AG 


6 0 0 


OAOAAOAAAA 


OCAACAAATC 


TOOTTTTOAT 


CTGTTCACTT 


TOTTATAATT 


ATTOATTATT 


660 


TACTGAATAT 


OAATATTTAT 


CTTTOTTTTT 


G AOT C AAT A A 


AT AT AC CTT T 


GTA A AG AC AG 


7 3 0 


A AT T A A AO T A 


TTAOTATTTC 


TTTCAAACTG 


OAOOCATTTC 


TCCCACTAAC 


ATATTTCATC 


710 


AAAACTTATA 


ATAAOCTTOO 


TTCCAGAGGA 


AOAAATOAOG 


GAT A ACC AAA 


AAT AG AO AC A 


• 40 


TTAATAATAO 


TOTAACOCCC 


AOTOATAAAT 


CTCAATAOOC 


AOTO ATOAC A 


OACATOTTTT 


9 00 


CCCAAACAC A 


AOOATOCTOT 


AAOOOCCAAA 


CAOAAATOAT 


OOCCCCTCCC 


CAOCACCTCA 


960 


TTTTOCCCCT 


TCCTTCAOCT 


ATOCCTCTAC 


TCTCCTTTAG 


AT AC AAOOO A 


GGTGOATTTT 


10 3 0 


T CT CTTCTCT 


GAOATAOCTT 


OATOOAACCA 


CAOOAACAAT 


OAAOTGOOCT 


CCTOOCTCTT 


1010 


TT CTCTOTOO 


CAOATOOOOT 


OCCATOCCCA 


CCTTCAOACA 


AAOOO A AO AT 


TOAOCT C AAA 


114 0 


AOCTCCCTOA 


OAAOTOAOAO 


CCTATOAACA 


TOOTTOACAC 


AOAOOOACAO 


OAATOTATTT 


1300 


CCAOOGTCAT 


TCATTCCTOO 


OAATAOTOAA 


CTOOGACATO 


OOGOAAOTCA 


OTCTCCTCCT 


1360 


OCCACAOCC A 


CAOATTAAAA 


ATAATAATOT 


TAACTOATCC 


CTAOOCTAA A 


AT AATAOTGT 


13 30 


TAACTOATCC 


CT AAOCTAAO 


AAAGTTCTTT 


TGGTAATTCA 


OGTGATGOCA 


GCAOGACCCA 


1310 


TCTT AAOOAT 


AOACTAOOTT 


TGCTTAOTTC 


OAOOTCATAT 


CTOTTTOCTC 


TCAOCCATOT 


14 4 0 


ACTOOAAOA A 


OTTGCATCAC 


ACAOCCTCCA 


OOACTOCCCT 


CCTCCTCACA 


OCAATOOATA 


15 0 0 


ATOCTTCACT 


AOCCTTTOCA 


OATAATTTTO 


GATCAGAOAA 


AAAACCTTOA 


OCTOOOCCAA 


15 60 


AAAOOAOOAO 


CTTCAACCTO 


TOTOCAAAAT 


CTGOGAACCT 


O AC AGT AT AO 


OTTOOOGOCC 


16 3 0 


AOOATOAOOA 


AAAAGOAACO 


OGAAAGACCT 


OCCCACCCTT 


CTOOTAAOOA 


OOCCCCOTOA 


1610 


TCAOCTCCAO 


CCATTTOCAO 


TCCTOOCT AT 


CCCAOOAOCT 


T AC AT A AAOO 


OAC A AT TOO A 


17 4 0 


OCCTOAOAOO 


TOACAOTOCT 


OACACTACAA 


OOCTCGGAGC 


TCCOOOCACT 


CAOACATC 


1 ? O ft 

i / y e 


ATG AOT TOO TCC TTO CAC CCC COO 
Met S*r Trp Set Leo His Pt© Af | 

1 5 


AAT TTA ATT 
Asa Lib 11c 

1 0 


CTC TAC TTC TAT OCT 
Lis Tyr Pbe Tyr Ala 

I 5 


1 S 46 


CTT TTA TTT CTC TCT TCA ACA TOT 
Leo Lev Pfae Lev Ser Sit Tbt Cys 

2 0 


GTA OCA GTAAOTOTOC TCTTCACAAA 
Va 1 Ala 

2 5 


1 « 9 6 


ACOTTOTTTA 


AAATOOAAAO 


CTOOAAAATA 


AAACAOATAA 


TAAACTAOTO 


AAATTTTCOT 


19 5 6 


ATTTTTTCTC 


TT T T AG TAT 


OTT OCT ACC AG A OAC AAC TOC TOC 


AT C TTA 


3005 




Tyr 


Vil Ala Thr Atg Asp As* Cys Cys 

3 0 35 


lie Lev 





OAT OA A AO A TTC OTAAOTAOTT TTT ATOTTTC TCCCTTTOTO TOTOAACTOO 3057 
Asp Glv At | Pbe 
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40 

AOAOOOOCAO AGO A AT AO A A AT AATTCCCT CAT AAAT ATC ATCTOOCACT TGTAACTTTT 2117 

TA A A A A CAT A OTCT AOOTTT TACCTATTTT TCTTAAT AOA TTTT AAOAOT AOCATCTOTC 2 177 

TACATTTTTA ATC ACTGTTA TATTTTCAG GOT AGT TAT TOT CCA ACT ACC TOT 2230 

Oly Ser Ty r Cy i Pro Tb i Thr Cys 
4 5 

GGC ATT OCA OAT TTC CTO TCT ACT TAT CAA ACC AAA OTA OAC AAG OAT 22 7 t 

Oly lie Alt A«p Phe Leo S«r Thr Tyr Ola Tbr Ly i Val Asp Lyt Atp 

50 55 60 6 5 

CT A CAO TCT TTO OA A OAC ATC TTA CAT CAA OTT OA A AAC AAA ACA T CA 23 2 6 

Lev Gin Ser Lev Ol Q Asp 11* Leo Hi* Ol I Val Olv Asa Ly I Tkr Ser 

7 0 7 5 SO 

OA A GTC AAA CAO CTO ATA AAA OCA ATC CAA CTC ACT TAT AAT CCT OAT 2374 

OId Val Ly i Ola Lev lie Lyt Ala lie Ola Lev Tbr Tyr Asa Pro Atp 

ft 5 90 9 5 

OA A T C A TCA AAA CCA A GTGAGAAAAT AAAOACTACT O AC C AAAA A A 24 20 

OId Ser Ser Ly i Pro 

1 0 0 

TAATAATAAT AATCTOTOAA OTTCTTTTOC TOTTOTTTTA OTTOTTCTAT TTOCTTAAOO 2410 

ATTTTTATOT CTCTOATCCT A T ATT A CAO AT ATO AT A OAC OCT OCT ACT TTO 2 5 3 2 

Asa Met lie Asp Ala Ala Tbr Lev 

10 5 110 

A AO TCC AGO ATA ATO TTA OA A OA A ATT ATO AAA TAT OAA OCA TCO ATT 2 5 8 0 

Ly* Ser Arg lie Met Lev Gin Glo lie Met Lys Tyr Olv Ala Ser lie 

115 12 0 125 

TTA ACA CAT OAC TCA AOT ATT CO OTAAGOATTT TTOTTTTAAT TTOCTCTOCA 2633 
Lev Tbr Bis Asp Ser Ser lie Art 

1 3 0 

AGACTOATTT AOTTTTTATT TAATATTCTA TACTTOAOTO A A AO T AATTT TTAATOTOTT 269 3 

TTCCCCATTT A T AAT A T C C C AOTOACATTA TOCCTOATTA TOT TO AO CAT AOT AO AO AT A 2 75 3 

GAAOTTTTTA OTOCAATATA AATTATACTO OOTTATAATT OCTTATTAAT AATCACATTG 2»13 

AAOAAAOATO TTCTAGATOT CTTCAAATOC TAOTTTOACC ATATTTATCA AAAATTTTTT 2173 

CCCCATCCCC CATTTATCTT ACAACAT A AA ATCAATCTCA TAGOAATTTO OGT0TTGAAA 2933 

ATAAAATCCT CTTT AT AAAA ATGCTOACAA ATTOOTOOTT AAAAAAATTA OCAAOCAOAO 2993 

OCATAOTAAO OATTTTOGCT CCTAAAOTAA ATTATATTOA ATGTGOAOCA OGAAGAAACA 305 3 

TGTCTIOAOA GACTAAOTGT OOCAAATATT OCAAAOCTCA TATTOATCAT TGCAOAATOA 3 113 

ACCTOCATAG TCTCTTCCCT T CATTTOOA A OTOAATGTCT CTGTTAAAGC TTCTCAOOOA 3 173 

CTCATAAACT TTCTOAACAT AAOOTCTCAO ATACAGTTTT AATATTTTTC CCCAATTTTT 3233 

TTTTCTOAAT TTTTCTCAAA GCAOCTTOAO AAATTOAOAT AAATAOTAOC TAOOOAOAAO 3293 

TOOCCCAGOA AAGATTTCTC CTCTTTTTOC TATCAOAOOO CCCTTOTTAT TATTOTTATT 3353 

ATTATTACTT OCATTATTAT TOTCCATCAT TOAAOTTOAA OOAGOTTATT GTACAOAAAT 34 13 

TOCCTAAOAC AAOOTAOAOO OAAAACOTOG ACAAATAOTT TOTCTACCCT TTTTTACTTC 3473 

AAAOAAAGAA CGOTTTATGC ATTGTAGACA GTTTTCTATC ATTTTTOOAT ATTTOCAAOC 3 533 

CACCCTOTAA OTAACTACAA AAOOAOOOTT TTT ACTTCCC CCAOTCCATT CCCAAAGCTA 3593 

TOTAACCAOA AOCATTAAAG AAOAAAOOOG AAOTATCTGT TOTTTTATTT TACAT ACA AT 3653 

AACGTTCCAG ATCATOTCCC TOTOTAAOTT ATATTTTAOA TTGAAGCTTA TATOT AT AGC 37 13 

CTCAGTAGAT CCACAAOTOA AAGGTATACT CCTTCAOCAC ATOTOAATTA CTOAACTOAO 3 773 
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CTTTTCCTOC TTCTAAAOCA TCAOGGGGTG TTCCT ATTAA CCACTCTCOC CACTCTT0CA 3 33 3 

GGTTGCTATC TGCTOTCCCT TATGCAT AAA GTAAAAAGCA A AATGTCAAT GACA7TTOCT 3 193 

T ATTGACAAG GACTTTGTTA TTTOTGTTOG GAOTTOAOAC AATATOCCCC ATTCTAAGT A 3953 

AAAAGATTCA OGTCCACATT OTATTCCTOT TTTAATTOAT TTTTTGATTT OTTTTTCTIT 40 13 

TTCAAAAAOT TTATAATTTT AATTCATGTT AATTTAGTAA T AT AATTTT A CATTTTCCTC 4073 

AAGAATGGAA TAATTTATCA GAAAGCACTT CTT AAGAAAA T ACT TAG C AG TTTCCAAAOA 4133 

AA AT AT AAA A TTACTCTTCT GAAAGOA ATA CTTATTTTTG TCTTCTT ATT TTTOTTATCT 4193 

TATGTTTCTG TTTOTAO A TAT TTO CAO OAA ATA TAT AAT TCA AAT AAT CAA 4244 

Tyr Lev Oil Gin II* Tyr An S*r A*a A«a Oil 

13 5 140 143 

A AG ATT GTT AAC CTG AAA GAG AAG OTA OCC CAO CTT OAA OCA CAO TOC 429 2 

L> i lie Val Asm Lett Ly • Olv Ly* Vil Ala Ola L*« Oil Ala Gla Cy • 
15 0 15 5 16 0 

CAO OAA CCT TOC AAA OAC ACG GTO CAA ATC CAT OAT ATC ACT GOO AAA O 434 1 
Ola Olv Pro Cy. Ly* Atp Thi Vil Gla 11* Hi* Asp II* Tar Oly Ly* 

16 5 170 175 

GTAACTGATG AAGOTTATAT TOOOATT AGO TT CAT CAA AG TAAGTAATOT AAAOGAOAAA 440 1 

GTATGTACTG OA AAGT AT AG OAATAOTTTA OAAAGTGGCT ACCCATTAAG TCTAAOAATT 446 1 

TCAOTTOTCT AOACCTTTCT TOAATAOCIA AAAAAAAC AG TTT AAAAOOA ATOCTGATGT 45 2 1 

OAAAAGTAAO AAAATTATTC TTOOAAAATO AATAOTTT AC T AC AT GTT A A AAOCTATTTT 45 tl 

TCAAOOCTGO CACAOTCTTA CCTOCATTTC AAACCACAGT AA AAGT CO AT TCTCCTTCTC 464 1 

TAG AT TGT CAA OAC ATT OCC AAT AAG OGA OCT AAA CAO AOC GOG CTT 46 SS 

A*p Cy* Ola Aip II* Ala A»a Ly* Oly Ala Ly* Ola S«r Oly Lav 
1 ft 0 l ft 5 190 

TAC TTT ATT AAA CCT CTG AAA OCT AAC C AG CAA TTC TIA OTC TAC TOT 4736 
Tyr Pb« II* Ly* Pro L*« Ly* Ala A*a Ola Ola Pa* L*» Val Tyr Cy* 

195 200 205 

OAA ATC OAT OOG TCT OGA AAT OOA TOG ACT GTO TTT CAO A AO 477* 
Olv lie Aap Oly Ser Oly A*a Oly Trp Tar Val Pit Ola Ly* 



2 1 0 




2 1 5 




2 2 0 






OT AATTTTTT 


CCCCACCATO 


TGTATTTAAT 


AAATTCCTAC 


ATTOTTTCTO 


CCATATOOCA 


4131 


OATACTTTTC 


T A AOC ACCTT 


GTOAACCOTA 


GCTCATTTAA 


TCCTTOCAAT 


AGCCCT A AGA 


4 1 9 S 


OOAAOOT ACT 


TCTOTTACTC 


CTA TTT AC AG 


AAAAOOA AAC 


TOAOOCACAC 


AAGOTTAAAT 


495 » 


AACTTOCCCA 


AOACCACATA 


ACTAATAAOC 


AACAGAGTCA 


OCATTTOAAC 


C T A GO CAOT A 


5 0 1 t 


TAOTTTCAOA 


OTTTOTOACT 


TOACTCTATA 


TTOTACTOOC 


AC T OAC T TTO 


TAOATTCATO 


5 07 t 


OTOOCACATA 


ATCATAOTAC 


CACAOTOACA 


AAT AAAAAOA 


AOGAAACTCT 


TTTOTCAOOT 


5 1 3 » 


AOGTCAAOAC 


CTOAOOTTTC 


CCATCACAAG 


ATGAGOAAGC 


CCAACACCAC 


CCCCCACCAC 


S19I 


CCCACCACCA 


TCACCACCCT 


TTCACACACC 


AGAOOATACA 


CTTOOOCTOC 


TCCAAOACAA 


5 25 1 


GOAACCTOTO 


TTOCATCTOC 


CACTTOCTOA 


TACCCACTAG 


OAATCTTOOC 


TCCTTTACTT 


53 13 


TCTOTTT ACC 


TCCCACCACT 


GTTATAACTO 


TTTCTACAOO 


GGOCOCTCAO 


AGGOAAT OAA 


5 3 7 3 


TOOTOOAAOC 


ATTAOTTOCC 


AGACACCGAT 


TOAOCAATOO 


OTTCCATCAT 


AAOTOT A AG A 


5 43 1 


AT CAOT A AT A 


TCCAOCTAOA 


0TTCT0AA0T 


CGTCTAOOTO 


TCTTTTTA AT 


ATT ACCACTC 


549 3 


ATTTAGAATT 


TATOATGTOC 


CAGAAACCCT 


CTT AAGT AT T 


TCTCTTATAT 


TCTCTCTCAT 


5 5 5 ft 


GATCCTTGC A 


OCAACCCTAA 


OAAOTAACCA 


TCATTTTTCC 


TATTTOATAC 


ATOAOOAAAC 


5 6 1ft 


TGAOOT AOCT 


TGOCCAAGAT 


CACTTAGTTO 


OOAOTTOATA 


OAACCAOTOC 


TCTG T ATTTT 


56 7ft 


TOACAAAATG 


TTGAC AOCAT 


TCTCTTTAC A 


TOCATTOATA 


OTCT ATTTTC 


TCCTTTTOCT 


573 ft 
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CTTGCAAATG TOT A ATT AO AO A CTT OAT OOC ACT OTA OAT TIC AAO AAA A AC 57*0 

Ai| Leo Asp 01 r Ser V* 1 Aip Pae Lyi Ly ■ Asa 

225 230 

TOO ATT CAA TAT AAA OAA OOA TTT OOA CAT CTO TCT CCT ACT OOC ACA 5*3» 

Trp lie Ola Tyr Lyt Ola Oly Pie Olj Hii Leo Ser Pro Tar Oly Tar 

235 240 245 

ACA OAA TTT TOO CTO OOA AAT OAO AAO ATT CAT TTO ATA AOC ACA CAO 5*16 

Thr Olo Phe Tip Leu Oly Ask OIo Lys lie His Lew lie Sir Tor Ola 

250 255 2 60 265 

TCT OCC ATC CCA TAT OCA T T A AO A 0T0 OAA CTO OAA OAC TOO AAT OOC 593 4 

Set Al* lie Pro Tyr Ala Leo Ai| Val Olo Leo Olo Asp Trp Asa Oly 
270 275 210 

AO A ACC AG GTACTOTTTT OAAATOACTT CCA ACTTTTT ATTOTAAAOA 39*2 
Arg Thr Ser 

TTOCCTGGAA TGTOCACTTT CCAACTATCA ATAOACAATO OCAAATOCAO CCTOACAAAT 6042 

OCAAACAOCA CATCCAOCCA CCATTTTCTC CAOOAOTCTO TTTOOTTCTT OOOCAATCCA 6102 

AAAAOOTAAA TTCT ATTC AO OATOAATCTA AOTOTATTOO TACAATCTAA TTACCCTOOA 6 162 

ACCATTCAOA OTAATAOCTA ATTACTOAAC TTTTAATCAO TCCCAOOAAT TOAOCATAAA 6222 

ATT AT AATTT TATCTAOTCT A AATTA CT AT TTCATGAAOC AOOTATTATT ATTAATCCCA 62S2 

TTTTATAOAT TAACTTOCTC AAAOTCACAT TOCTO AT AAO TOOTAOAOOT AOAATTC AO A 6342 

CTCAAOTAOT TTAACTTTAO AOCCTOTCCT CTTAACAACT ATCCTOOTTO A A AAO C A AAT 6402 

ACAOCCTCTT CAGACTTCTC AGTGCCTTOA TOOCCATTTA TTCTOTCAAA TCATOAOCTA 6462 

CCCTAAAAOT AAACC AOCT A OCTCTT TTO A TOAT CT AO AO OCTTCTTTTT OCTTOAOATA 65 22 

TTTGAAGGTT TTAAOCATTG TTACCTAATT AAAATOCAOA AAAATATCCA ACCCTCTTOT 6512 

TATOTTTAAO OAATAOTOAA ATATATTOTC TTCAAACACA TOOACTTTTT TTTATTOCTT 6642 

OOTTOOTTTT TAATCCAOAA AGTOCTATAO TCAOTAOACC TTCTTCTAOO A A AGO AC CTT 6702 

CCATTTCCCA GCCACTOOAG ATTAOAAAAT A AO CT A A AT A TTTTCTOOAA ATTTCTOTTC 6762 

ATTCATTAAG OCCCATCCTT TCCCCCACTC TATAOAAOTG TTOTCCACTT OCACAATTTT 6122 

TTCCAOOAAA GAATCTCTCT AACTCCTT C A GCTCACATGC TTTGOACCAC ACAOOOAAOA 6112 

CTTTO ATTOT OTAATOCCCT CAOAAOCTCT CCTTCTTOCC ACTACCACAC TOATTTOAOO 6942 

AAOAAAATCC CTTTAOCACC TAACCCTTCA OOTOCTATOA OTOOCTAATO OAACTOTACC 7002 

TCCTTCAAOT T TTOTOC AAT AATTAAOOOT CACTCACTOT CAOATACTTT CTOTOATCTA 7062 

TOATAATGTO TOTOCAACAC ATAACATTTC AATAAAAGTA OAAAATATOA AATTAOAOTC 7122 

AT CT AC ACAT CTOO ATTTO A TCT T AO AAT 0 AAACAAOCAA AAAAOCATCC A AOTO AGTOC 71»2 

AATTATTAOT TTTCAOAOAT OCTTCAAAOO CTTCTAOOCC CATCCCOOOA AOTOTTAATO 7242 

AOCTOTOOAC TOOTTCACAT ATCTATTOCC TCTTGCCAOA TTTOCAAAAA ACTTCACTCA 7302 

ATOAOCAAAT TTCAOCCTTA AOAAACAAAO TCAAAAATTC CAAOOAAOCA TCCTACOAAA 7362 

OAOOOAACTT CTOAOATCCC TOAOOAOOOT CAOCATOTOA TOOTTOTATT TCCTTCTT CT 742 2 

CAO T ACT OCA OAC TAT OCC AT O TTC AAO OTO OOA CCT OAA OCT OAC 7461 

Thr Al* Asp Tyr Alt Mel Phe Ly» Vil Oly Pro Olo Ala Asp 
215 290 295 

A AG TAC COC CTA ACA TAT OCC TAC TTC OCT GOT OOO OAT OCT OOA OAT 75 16 
Lys Tyr Art Leo Thr Tyr Ala Tyr Phe Ala Oly Oly Asp Ala Oly Asp 
300 305 310 

OCC TTT OAT OOC TTT OAT TTT OOC OAT OAT CCT AOT OAC AAO TTT TTC 7564 
Ala Pbc Asp Oly Phe Asp Phe Oly Asp Asp Pre Ser Asp Lys Pbe Phe 

315 320 325 330 
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ACA TCC CAT AAT OOC ATO CAO TTC AOT ACC TOO OAC AAT OAC AAT OAT 7*12 

T*r Ssr Bis Asm Oly Met Olm Pbe S«r Tir Tip Asp As* Asp Asa Asp 

3 3 5 340 345 

AAO TTT OAA OOC AAC TOT OCT OAA CAO OAT OOA TCT OOT TOO TOO ATO 7 660 

Lys Pis Ols Oly Asm Cys Ala Ol« Olm Asp Oly Ssi Oly Tip Tip Ms t 

350 355 360 

AAC AAO TOT CAC OCT OOC CAT CTC AAT OOA OTT TAT TAC CAA O 77 03 

Asm Lyi Cys Bis AU Oly Bis L • n Asm Oly Vtl Tyr Tyx Olm 



3 6 5 


3 7 0 




3 75 






OTATOTTTTC 


CTTTCTTAOA 


TTCCAAOTT A 


ATOTATAOTO 


TATACTATTT 


TC ATA A AAA A 


7763 


TAATAAATAG 


ATATOAAOAA 


ATOAAGAATA 


ATTTATAAAO 


ATAOTAOOOA 


TTTTATCATO 


7 12 3 


TTCTTT ATTT 


C AACTAAOTT 


CTTTOAAACT 


OOAAOTOOAT 


AAT ACC A AOT 


TCATOCCT AA 


711) 


AATT AOCCCT 


TCT AAAOAAA 


TCCACCTOCT 


OCAAAATATC 


CAOTAOTTTO 


OCATTATATO 


7 9 43 


TO AAACTATC 


ACC ATCATAO 


CTOOCACTOT 


OOOTTOTOOO 


ATCTCCTTT A 


OAC AT AC AAC 


1003 


AT AAATOATC 


TOOATOOATT 


AACATTACTA 


CATOOATOCT 


TOTTOACACA 


TTAACCTOOC 


t 06 3 


TTCCCATOAO 


CTTTOTOTCA 


OATACACOCA 


OTOAACAOOT 


OTTTOOAOOA 


ACAOAATAAA 


IMS 


OAOAAOOCAA 


OCACTOOT AA 


OOOCAOOOOT 


T TO TO A A AOC 


T TO AO AO A AO 


AOACCAOTCT 


1 I 1 3 


OAOOACAOT A 


OAC ACTTATT 


TTAOOATOOO 


OOT TOO AT OA 


OOAOOCTATA 


OTT TOC TATA 


t 243 


AOCTTGOAAT 


OOTTTOOAAC 


ACTGGTTTCA 


CTCACCTACC 


CAOCAOTTAT 


OTOTOOOOAA 


t 3 03 


OCCTTACCOA 


TOC TAAAOOA 


TCCATOTTAC 


AATAATGGCA 


TTATTTOOAA 


ATCCCAOTOO 


13 63 


T ATT CCATOA 


AT AAAACCAC 


TATOAAOATA 


ATCCCACTCA 


ACAOACTCTC 


COTTOOAOAA 


• 423 


OOACAOCAAC 


ACCACCCTOO 


OAAAOCCAAA 


CAOTCAOACC 


AOACCTOTTT 


AOCATC AOT A 


1413 


OOACTTCCCT 


ACCATATCTO 


CTOOOTAOAT 


OAOTOAAACC 


AO TOT TCC AA 


ACCACTCCOO 


15 43 


OCTT GT AOCA 


AACC AT AOT C 


TCCTCATCT A 


CCAAOATOAO 


CAACCTTACC 


TCCTOATOTC 


16 03 


CT AOCCAATC 


ACCAACTAOO 


AAACTTTOCA 


CAGTTTATTT 


AAAOT A ACAO 


TTTOATTTTC 


1663 


AC AAT ATTTT 


TAAATTOOAO 


AAACATAACT 


TATCTTTOCA 


CTCACAAACC 


A CAT A A TO AO 


• 7 23 


AAOAAACTCT 


AAOOOAAAAT 


OCTTOATCTO 


TOTOACCCOO 


OOCOCCATOC 


CAOAOCTOTA 


• 7 13 


OTTCATOCCA 


OTOTTOTOCT 


CTOACAAOCC 


TTTTACAOAA 


TT AC ATO AO A 


TCTOCTTCCC 


• 143 


TAOOACAAOO 


AOAAOOCAAA 


TCAACAOAOO 


CTOCACTTTA 


AAATOOAOAC 


ATAAAATAAC 


• 9 03 


ATOCCAOAAC 


CATTTCCTAA 


AOCTCCTCAA 


TCAACCAAC A 


AAATTOTOCT 


TTCAAATAAC 


• 963 


CTOAOTTOAC 


CTC ATCAOOA 


ATTTTOTOOC 


TCCTTCTCTT 


CTAACCTOCC 


TO A AO A AAO A 


9023 


TOOTCCACAO 


CAOCTOAOTC 


COOOATOOAT 


AAOCTTAOOO 


ACAOAOOCCA 


ATTAOOOAAC 


90 S3 


TTTOOOTTTC 


TAOCCCTACT 


AOTAOTOAAT 


AAATTTAAAO 


TOTOOATOTO 


ACTATOAOTC 


9 1 43 


ACAGCACAOA 


TOTTOTTTAA 


TAATATOTTT 


ATTTTAT AAA 


TTOATATTTT 


AOOAATCTTT 


9203 


OOAOATATTT 


TCAOTT AOC A 


OATAATACTA 


TAAATTTT AT 


OTAACTOOCA 


ATOCACTTCO 


9263 


T AAT AO AC AO 


CTCTTCATAO 


ACTTOCAOAO 


OT AAAAAOAT 


TCCAOAATAA 


TOATATOTAC 


93 23 


ATCTACOACT 


TOTTTTAO OT OOC ACT TAC TCA AAA 


OCA TCT ACT CCT AAT 


9 3 7 3 




Ol 


y Oly Tkt Tyr Ssr Lyi 


Ala St r Thr 


Fro Aim 





3 10 3 15 

OOT TAT OAT AAT OOC ATT ATT TOO OCC ACT TOO AAA ACC COO TOO TAT 9421 

Oly Tyr Asp Asm Oly 11* lis Trp Aim Tbr Trp Lyi Thr Art Trp Tyr 

390 395 400 

TCC ATO AAO AAA ACC ACT ATO AAO ATA ATC CCA TTC AAC AO A CTC ACA 9469 

Ssr Ms t Lys Lyi Tbr Tbr Ms t Lys 1 1 • lis Pro Pbt Asm Ar« L«« Tkr 

403 410 415 

ATT OOA OAA OOA CAO CAA CAC CAC CTO OOO OOA OCC AAA CAO OTC AO A 95 17 
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lie Oly Gin Oly Gla Oil Bit Hti L«b Oly Oly Ala Lyi Ola Val Arg 
420 425 430 435 

CCA OAO CAC CCT OCO OA A ACA OAA TAT OAC TC A CTT TAC CCT OAO OAT 95 6 5 

Pro OU His Fro Ala Glo Tfar Glo Tjr Asp Sar Le>» Tyr Pro OU Asp 
440 445 450 

OAT TTO TAOAAAATTA ACTOCTAACT TCTATTOACC C AC AAAOTTT C AO A A ATTCT 962 
Asp Lev 

CTOAAAOTTT CTTCCTTTTT TCTCTT ACT A TATTTATTOA TTTC AAOTCT TCTATTAAOO 96* 

ACATTTAOCC TTCAATOOAA ATT AAA AC TC ATTTAOOACT OTATTTCCAA ATT AC TOAT A 974 

TCAOAGTTAT TT A AAAATTO TTTATTTOAO OAOATAACAT TTC AACTTTO TTCCTAAATA 9*0 

TATAATAATA A AATOATTOA CTTT ATTTOC ATTTTT ATGA CCACTTOTCA TTTATTTTGT 9*6 

CTTCOTAAAT TATTTTCATT ATATCAAATA TTTT AGTATO TACTTAATAA AATAOOAOAA 992 

CATTTTAOAO TTTCAAATTC CCAOOTATTT TCCTTOTTTA TTACCCCTAA ATCATTCCTA 99* 

TTTAATTCTT CTTTTTAAAT OOAOAAAATT ATOT CTTTTT AATATOOTTT TTOTTTTOTT 1004 

ATATATTCAC AOOCTOOAOA CCTTTAAAAO ACCOTTTCAA AAO AOATTT A CTTTTTTAAA 10 10 

GOACTTT ATC TOAACAOAOA OATATAATAT TTTTCCTATT OOACAATOOA CTTOCAAAOC 10 16 

TTCACTTCAT TTTAAGAGCA AAAGACCCCA TGTTGAAAAC TCCATAACAO TTTTATOCTO 1022 

ATOATAATTT ATCTACATGC ATTTCAATAA ACCTTTTGTT TCCTAAOACT AGATACATOG 102* 

TACCTTTATT GACCATTAAA AAACCACCAC TTTTTOCCAA TTTACCAATT AC AATTOOGC 1034 

AACCATCAGT AOTAATTOAG TCCTCATTTT ATGCTAAATG TTATOCCTAA CTCTTTOGGA 1040 

GTTACAAAGG AAATAOCAAT TATGGCTTTT GCCCTCTAGG AO AT AC AGO A CAAATACAOG 1046 

AAAATACAGC AACCCAAACT GACAATACTC TATACAAGAA CATAATCACT AAOCAOOAOT 1052 

CACAOCCACA CAACCAAGAT GCATAGTATC CAAAOTGCAO CTO 10564 

( 2 ) INFORMATION FOR SEQ ID NO& 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 453 amiao mis 
( B ) TYPE: soaao acid 
( D ) TOPOLOGY: b*ear 

( i i ) MOLECULE TYPE: procaa 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO36; 

M* 1 Ser Tip S • r Lit Hi 1 Pro Arg A » * Leo lie Leo Tyr Pbe Tyr Ala 
1 5 10 15 

Lev Leo Pbe Lao S*r Sat Tbr Cys Val Ala Tyr Val Ala Tbr Art Asp 

2 0 2 5 3 0 

Asa C y 1 Cy 1 Ila L * a Asp Ola At| Pas O I y Sar Tyr Cys Pro Tbr Tbr 
3 5 4 0 4 5 

Cys Gly Ma Ala Asp Pba Laa Sar Tbr Tyr Gla Tbr Lys Val Asp Lys 

5 0 5 5 6 0 

Asp Lao Ola Sot Las Ola Asp 11a Lav Hi 1 Ola Val Ola Asa Lys Tbr 

65 70 75 SO 

Sar Ola Val Lys Gla Laa Ila Lys Ala Ila Gla Lao Tbr Tyr Asa Pro 
t 5 9 0 9 5 

Asp Ol o Ser S • r Lys Pre Asa Met lie Asp Ala Ala Tbr Lev Lys S e r 
10 0 10 5 110 

Art Il« Mat Lao Gin Gla lie Mai Lys Tyr Glo Ala Sar Ila Lao Tbr 

115 12 0 125 

His Asp Ser Ser lie Art Tyr Lao Ola Glo lie Tyr Asa Sar Asa Asa 

13 0 13 5 14 0 
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Ola Ly. !!• Val At* L.a Ly . Glo Ly. Val AU Oil L*« Ola Ala OH 



143 150 



3 0 5 



155 1«0 



Cy» Oil OlB Pro Cy» Lyi A.p Thi Val Oil II* Hii Aip II* Thi Oly 

165 170 175 

Lye A.p Cyi Ola A.p II* Alt Asa Ly. Oly Al* Lyi Ola S*r Oly L*a 

1«0 115 1 90 

Tyr Pb. lie Lyi Pro Lea Ly. Ala At* Ola Ola Pa* L*a Val Tyi Cy * 

195 200 205 

Ola lie A.p Oly Sex Oly A. a Oly Trp Thi Val Pb* Ola Ly» Art L*» 

2 10 2 15 210 

A.p Oly S*r Val A.p Ph* Lyi Ly. A.a Trp II* Ola Tyr Ly« Ola Oly 



225 230 



235 340 



Ph* Oly Hii L*a 5«r Pr* Th r Oly Tar Tar Ola Ph* Trp L«a Oly Alt 

245 250 255 

Ola Lyi II* Hit L*v II* Ser Tax Ola Ser Ala II* Pro Tyr Ala L*a 

260 265 270 

Arg Val Ola L • a Ola A.p Trp A.a Oly Art Th r S*r Thr Al* Aip Tyr 

275 210 2»5 

Al* Met Phe Lyi Val Oly Pro Ola Al* A.p Lyi Tyr Arg L*a Thr Tyr 

290 295 300 

Al* Tyr Ph* Ala Oly Oly Aip Al* Oly Aip Al* Pha A.p Oly Ph* Atp 



3 15 320 



Phe Oly Aip Aip Pro Ser Aip Lyi Phe Ph* Thr Ser His All Oly M* t 

325 330 335 

Ola Phe S*r Thr Trp Aip An A.p An Atp Lyi Ph* Ola Oly A.a Cyi 

340 345 350 

At* Ola Ola Aip Oly S*r Oly Trp Trp M*t Ai* Ly* Cyi Hii Al* Oly 



355 



3 65 



Hii Lea Aia Oly Val Tyr Tyr Oil Oly Oly Thr Tyr Ser Lyi Al* S*r 

370 375 3*0 

Thi Pro A.a Oly Tyr A.p A.» Oly lie lie Trp Ala Thr Trp Lyi Thr 
3S5 390 395 400 

Arg Trp Tyr S*r Met Lyi Lyi Thr Thr M* t Ly. II* II* Pro Phe A. a 
40 5 4 10 4 15 

Arg Lea Thr lie Oly Ola Oly Ol* Oil Hit Hi. L*a Oly Oly Al* Lyi 
420 425 430 

Ol* Val Arg Pro Ola Hi. Pro Al* Ola Thr Ola Tyr Alp S*r Leo Tyr 
435 440 445 



Pro Ola A I 
4 5 0 



{ 2 ) INFORMATION FOR SBQ ID NOi7: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 10807 b**e pan 
( B ) TYPE: sackic aad 
( C ) STJtANDEDNESS: doable 
{ D ) TOPOLOOT: hmm 

(Tit) IMMEDIATE SOURCE: 

< B ) CLONE: <mae b*»- bao*k*x&a 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NO.7: 

ACOCOTOTCO ACCTOCAOOT CAACOOATCT CTOTOTCTOT TTTCATOTT A OTACCACACT 60 

OTTTTOOTOO CTOT AOCTT T CAOCTACAOT CTOAAOTCAT AAAOCCTOOT ACCTCCAOCT 130 



CTOTTCTCTC TCAAOATTOT OTTCTOCTOT TTOOOTCTTT AOTOTCTCCA CACAATTTTT 



ISO 
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AG A AT TOTTT GTTCT AOTTC TGTOAA A A A T OATOCTOOTA TTTTOATAAG OATTOCATTO 240 

AAT CTOT AAA OCTACAOATA T AOTC AT TOO OTAOTACAOT CACTTTAACA ATATT AACTC 3 00 

TTCACATCTO TOAOCATOAT ATATTTTCCC CCTCTATATC ATCTTCAATT CCTCCTATCA 360 

OTTTCTTTCA TTOCAOTTTT CTOAOTACAO GTCTTACACC TCCTTOOTTA OAGTCATTCC 420 

TCAGTATTTT ATTCCTTTGA TACAATTGTO AATGAOOTAA TTTTCTTAGT TTCTCTTTCT 480 

GATAGCTCAT TGTTAGTOTA TATATAOAAA AGCAACAGAT TTCTATGTAT TAATTTTGTA 540 

TCCTGCAACA GATTTCTATG TATTAATTTT GTATCCTOCT ACTTTACGGA ATTCACTTAT 600 

TAGCTTTTTG GTGACATCTT GAOGATTTTC TGAAGAAAAT OGCATOGTAT GGTAGOACAA 660 

OGTGTCATGT CATCTOCAAA CAGTGGCAGT TTTCCTTCTT CCCTTCCAAC CTGOATTTCT 720 

TTGATTTCTT TCTOTCTGAO TACGACTAGG ATTCCCAATA CTATACCOAA TAAAAGTOGC 7 SO 

A AGAGTGGAC ATCCTTOTCT TATTTTTCTG ACCTTAGAOG A A ATGCTTTC AOTTTTTCAC S40 

CATTAATTAT AATGTTT ACT GTGGGCTTGT CATATGTOOC CTTCATTATA TOOAGOTCTA 900 

TTCCCTCTAT ACCCACCTTG TTOAG AGTTT TTATCATAAA AGTATOTTGA ATTTTGT CAA 960 

AAGTTTTTCC TGCATCTATT OAGATGATTT TTACTCTTCA ATTCATTAAT GATTTTTATT 1020 

CTTCATTTTG TTAATGATTT CCATTCTTCA ATTTGTTAAC GTOGTATATC ACATTGATTG 1080 

ATTTGTGGAT ACCTTTGTAT CCCTOGGATA AACCTCACTT OATCATGAGC TTTCAATOTA 1140 

TTTTTOAATT CACTTTGCTA ATATT CTCTT GGGTATTTTT OCATCTCTAT TCATCAATGA 1200 

TATTGOCCTA AGAAAGGTTT TGTCTGGTTT TAGTATCAOG GTOATGCTOO CCTCATAOAO 1260 

AGAGTTIAGA AOCATTTCCT CCTCTTTGAT TTTTCGGAAT AGTTTGAGTA OGATAGGTAT 13 20 

TAACTCTTCT TTAAATGTTT OGGGACTTCC CTGOTGAOCC GGTGOTTOAG AATCCGCCTC 13 »0 

AGOGATOTOG OTTTOATCCC TGGTCAOGOA ACCATTAATA AGATCCCACA TGCTOCAOGC 1440 

AACAAOCCCC CAAGCTOCAA CCACTGAGCT OCAACCOCTO CAOTOCCCAC AOOCCACOAC 1500 

CAGAGAAAGC CCACATACAG CAGGGAAGAC CCAOCACAAC COGAAAAAOG AGTTTGGTOG 1560 

AATACAGCTO TO A AGCC GT C TOG TC CTOG A CTCCTOCTTG AOOOAATTTT TTAAAAATTA 1620 

TTGATTCAAT TTCATTACTG OTAACTOGTC TGTTCATATT TTCTATTTCT TCCOGOTTCA 16*0 

OTCTTOOGAO ATTOTACATG CCTAOGAATG TOTCCGTTTC TTCTAOOTTG TCCATTTTAT 1740 

TOGACATGCA TOOOAOCACA CAGCACCOAC CAGCGAGACT CATGCTGOCT TCCTOOOGCC 1SO0 

AOGCTOGOOC CCCAAOCAOC ATOOCATCCT AGAOTOTGTG AAAGCCCACT GACCCTOCCC 1S60 

AOCCCCACAA TTTCATTCTG AGAAOTOATT CCTTOCTTCT GCACTTACAG OCCCAOOATC 1920 

TOACCTOCTT CTOAOOAGCA OOGGTTTTOO CAOOACGQGG AGATGCTOAG AGCCGACOGG 1980 

OOTCCAOGTC CCCTCCCAOO CCCCCCTOTC TOOOOCAGCC CTTGGGAAAG ATTOCCCCAG 2040 

TCTCCCTCCT ACAOTOOTCA GTCCCAOCTO CCCCAOOCCA OAGCTOCTTT ATTTCCGTCT 2100 

CTCTCTCTOO ATOGT AT TC T CTGOAAO CTG AAOOTTCCTG AAOTTATOAA TAOCTTTOCC 2160 

CTGAAOOGCA TGGTTTGTGO TCACOGTTCA CAOOAACTTG OGAGACCCTO CAGCTCAOAC 2220 

OTCCCGAGAT TGGTOOCACC C AG AT TTC CT AAGCTCGCTG GGOAACAOOG COCTTOTTTC 22 8 0 

TCCCTOOCTO ACCTCCCTCC TCCCTOCATC ACCCAOTTCT GAAAGCAOAG CGGTOCTOOG 2340 

OTCACAOCCT CTCGCATCTA ACOCCOOTOT CCAAACCACC COTOCTOOTO TTCOOOOOOC 2400 

TACCTATGOG OAAGGOCTTC TCACTOCAGT OGTOCCCCCC OTCCCCTCTG AGATCAOAAG 2460 

TCCCAOTCCO OACOTCAAAC AOGCCGAGCT CCCTCCAGAG OCTCCAGOOA OOOATCCTTO 25 20 

CCCCCCCGCT OCTOCCTCCA OCTCCTOOTG CCOCACCCTT GAOCCTGATC TTOTAGACOC 25S0 
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CTCAOTCTAO TCTCTOCCTC COTOTTCACA COCCTTCTCC CCATOTCCCC TCCOTOTCCC 2640 

COTTTTCTCT CACAAGGACA CCOOACATTA OATTAOCCCC TOTTCCAOCC TCACCTOAAC 2700 

AOCTCACATC TOT A A AO AC C TAOATTCCAA AC AAOATTCC AACCTOAAOT TCCCOOTOOA 2760 

TGTOAOTTCT OOOOCOACAT CCTTCAACCC CATCACAOCT TOCAOTTCAT COCAAAACAT 2*20 

OOAACCTOOO OTTTATCOTA AAACCCAOCT TCTTCATOAA ACACTOAOCT TCOAOOCTTO 2»*0 

TTOCAAOAAT TAAAOOTOCT AATACAOATC AOOOCAAOOA CTOAAOCTOO CTAAOCCTCC 2940 

TCTTTCCATC ACAOOAAAOO OOOOCCTOOQ OOCOOCTOOA OOTCTOCTCC COTOAOTOAG 3000 

CTCTTTCCTO CTACAOTCAC CAACAOTCTC TCTOOOAAOG AAACCAGAGG CCAOAOAOCA 3060 

AGCCOGAOCT AGTTTAOOAO ACCCCTGAAC CTCCACCCAA OATOCTOACC AOCCAOCOOO 3 120 

CCCCCTOOAA AGACCCTACA GTTCAOOOOG OAAGAOGOOC TOACCCGCCA OOTCCCTOCT 3 1*0 

ATCAOGAGAC ATCCCCGCTA TCAOOAOATT CCCCCACCTT GCTCCCGTTC CCCTATCCCA 3240 

ATACOCCCAC CCCACCCCTG TGATOAOCAG TTTAOTCACT T AGAATOTC A ACTGAAOOCT 3300 

TTTOCATCCC CTTTGCCAGA GOCACAAOGC ACCCACAGCC TOCTOGOTAC COACOCCCAT 3360 

GTGGATTCAO CCAGOAOOCC TGTCCTOCAC CCTCCCTOCT CGGGCCCCCT CTOTGCTCAG 3420 

CAACACACCC AGCACCAOCA TTCCCOCTGC TCCTOAOOTC TOCAOOCAOC TCOCTOTAGC 34*0 

CTGAOCGGTO TGOAGGGAAG TOTCCTOOOA GATTT AAAAT G TO AG AGO CO OOAOOTOGGA 3 340 

GOTTOGOCCC TOTOOOCCTO CCCATCCCAC OTGCCTGCAT TAOCCCCAOT OCTGCTCAOC 3600 

COTOCCCCCO CCOCAOGOGT CAOGTCACTT TCCCOTCCTG OOGTTATTAT OACTCTTOTC 3660 

ATTOCCATTO CCATTTTTGC TACCCTAACT OOGCAGCAGO TGCTTOCAGA OCCCTCGATA 3720 

CCGACCAGGT CCTCCCTCGG AOCTCGACCT OAACCCCATO TCACCCTTOC CCCAOCCTOC 37*0 

AOAOOOTOOO TGACTGCAOA OATCCCTTCA CCCAAOOCCA CGOTCACATO OTTTOOAOOA 3*40 

GCTOGTOCCC AAOOCAOAOO CCACCCTCCA OGACACACCT OTCCCCAGTG CTGGCTCTOA 3900 

CCTGTCCTTO TCTAAGAGOC TOACCCCOOA AOTGTTCCTG OCACTOOCAG CCAOCCTOOA 3960 

CCCAGAGTCC AOACACCCAC CTGTGCCCCC GCTTCTGOGG TCTACCAOGA ACCOTCTAOO 4020 

CCCAGAGGOO ACTTCCTOCT TOGCCTTOOA TOGAAOAAOG CCTCCTATTG TCCTCOTAGA 40*0 

OOAAOCCACC CCOOOOCCTO AOGATOAOCC AAOTOOGATT CCOOOAACCO COTOOCTOOO 4 140 

OGCCCAGCCC GOOCTOOCTO OCCTOCATGC CTCCTGTATA AOGCCCCAAO CCTOCTOTCT 4200 

CAOCCCTCCA CTCCCTOCAO AGCTCAOAAO CACOACCCCA OOGATATCCC TOCAOCCATO 4260 

AAGTOCCTCC TOCTTOCCCT OGOCCTOOCC CTCOCCTOTG OCGTCCAOOC CATCATCGTC 4320 

ACCCAOACCA TGAAAOGCCT GOACATCCAO AAOOTTCGAG OOTTGOCCGG GTOOGTOAOT 43*0 

TOCAOOOCOO OCAGOOGAGC TOOOCCTCAO AOAOCCAAOA GAOGCTGTOA COTTOGOTTC 4440 

CCATCAOTCA GCTAGOOCCA CCTGACAAAT CCCCOCTOOO OCAOCTTCAA CCAOOCGTTC 4300 

ACTOTCTTGC ATTCTOGAGO CTOOAAOCCC AAOATCCAGO TOTTOOCAOO OCTOOCTTCT 4560 

CCTOCGOCCO CTCTCTGOOO AOCAOACOOC COTCTTCTCC AOTCCTCTOC OCOCCCTOAT 4620 

TTCCTCTTCC TOTOAOOCCA CCAOGCCTGC TOOAAACACO CCTGCCTGCO CAOCTTCACA 46*0 

COACCTTTOT CATCTCTTTA AAGOCCATGT CTCCAGAOTC ATGTGTTGAA OTTCTGOOGO 4740 

TTAOTOOOAC ACAGTTCAOC CCCTAAAAGA OTCTCTCTGC CCCTCAAATT TTCCCCACCT 4*00 

CCAGCCATGT CTCCCCAAGA TCCAAAT OTT OCTACATOTO OGOOOOCTCA TCTOOOTCCC 4S60 

TCTTTOGOTT CAGTGTGAOT CTOOOOAOAO CATTCCCCAO OGTOCAGAOT TGOOGOOAGT 4920 

ATCTCAOOOC TOCCCAOOCC OOGOTGGOAC AOAGAGCCCA CTOTGOOOCT QQGOOCCCCT 49*0 
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TCCCACCCCC 


AGAOTGCAAC 


TCAAGOTCCC 


TCTCC AOGTG 


OCOGGGACTT 


GG C ACT CCTT 


3 0 4 0 


OOCTATOOCO 


OCCAGCGACA 


TCTCCCTOCT 


GGATOCCCAG 


AOTGCCCCCC 


TGAGAGTGT A 


5 10 0 


COTOOAOOAO 


CTGAAGCCCA 


CCCCCOAOGO 


CAACCTOOAO 


ATCCTOCTGC 


AG A A ATGGTG 


5 16 0 


OOCOTCTCTC 


CCCAACATOG 


AACCCCCACT 


CCCCAGGOCT 


GTGOACCCCC 


COGGGGGTOG 


5 2 2 0 


OOTOCAOOAO 


OOACCAOOOC 


CCCAGGGCTG 


OGGAAGAOGG 


CTCAGAGTTT 


ACTOGTACCC 


5 2 t 0 


OGCGCTCCAC 


CCAAGGCTOC 


CCACCCAGOG 


CTTTTTTTTT 


TTTTAAACTT 


TT ATTAATTT 


5 3 4 0 


OATGCTTCAG 


AACATCATCA 


AACAAATOAA 


CATAAAACAT 


TCATTTTTOT 


TTACTTOGAA 


5 4 0 0 


OGGGAOAT AA 


AATCCTCTGA 


AOTOGAAATO 


C AT AGCAA AG 


AT AC AT AC AA 


TOAOOCAOGT 


5 4 0 0 


ATTCTGAATT 


CCCTOTTAGT 


CTGAGGATTA 


CAAGTOTATT 


TGAGC AAC AG 


AGAGACATTT 


5 5 2 0 


T C ATCATTTC 


TAOTCTOAAC 


ACCTCAOTAT 


CTAAAATGAA 


CAAGAAGTCC 


TGOAAACOAA 


5 5 ft 0 


GCAGTGTOGG 


GATAGGCCCG 


TOTOAAOGCT 


GCTGOOAGGC 


AGC AGACCTG 


GGTCTTCGGG 


5 6 4 0 


CTCAAGCAGT 


TCCCGCTACC 


AOCCCTGTCC 


ACCTCAOACG 


OGGOTCAGOG 


TGCAGGAGAG 


5 7 0 0 


AGCTOOATGO 


GTGTGOGGOC 


AGAGATGGGO 


ACCTOAACCC 


CAGGGCTGCC 


TTTTGOOOGT 


5 7 6 0 


GCCTGTGOTC 


AAGGCTCTCC 


CTGACCTTTT 


CTCTCTGOCT 


TCATCTGACT 


TCTCCTGGCC 


5 8 2 0 


C ATCCACCCG 


GTCCCCTGTO 


GCCTOAGGTO 


ACAOTGAGTO 


COCCGAGGCT 


AOTTGOCCAO 


5 8 8 0 


CTOGCTCCT A 


TOCCCATGCC 


ACCCCCCTCC 


AOCCCTCCTG 


OGCCAOCTTC 


TOCCCCTOGC 


5 9 4 0 


CCTCAGTTC A 


TCCTGATGAA 


AATOGTCCAT 


OCCAATOGCT 


C AG A A AGC AG 


CTGTCTTTCA 


6 0 0 0 


GOGAGAACGG 


COAOTGTOCT 


CAOAAGAAGA 


TTAT TO C AG A 


AAAAACCAAG 


ATCCCTOCGO 


6 0 6 0 


TOTTCAAG AT 


CO ATGGTO AG 


TCCGOOTCCC 


TGGGOGACAC 


CCACCACCCC 


CGCCCCCGOG 


6 12 0 


OACTOTGGAC 


AGGTTCAGOG 


GOCTGOCOTC 


GGOCCCTOGG 


ATOCTAAOGO 


ACTGGTGGTG 


6 18 0 


ATOAAG ACAC 


TGCCTTOACA 


CCTGCTTCAC 


TTOCCTCCCC 


TGCCACCTGC 


CCOOOOCCTT 


6 2 4 0 


OOOOCGOTOO 


CC ATGGOC AO 


OTCCCOGCTO 


GCGGOCTAAC 


CCACCAGOOT 


GACACCCGAG 


6 3 0 0 


CTCTCTTTOC 


TGOGGGOCOG 


OCOOTGCTCT 


OOGCCCTCAG 


GCTGAGCTC A 


OGAGOTACCT 


6 3 6 0 


GTOC CCTCCC 


AOOOOTAACC 


OAOAGCCGTT 


OCCCACTCCA 


GGGGCCCAGO 


TOCCCCACGA 


6 4 2 0 


CCCCAOCCCO 


CTCCACAOCT 


CCTTCATCTC 


CTOGAGACAA 


ACTCTOTCCG 


CCCTCOCTCA 


64S0 


TTCACTTOTT 


COTCCTAAAT 


CCO AO ATOAT 


AAAOCTTCGA 


OGOGOOOTTO 


OGGTTCCATC 


6 5 4 0 


AGGOCTOCCC 


TTCCOCCOOO 


CAGCCTOOGC 


C ACATCTOCC 


CTTGGCCCCC 


TCAOGACTC A 


6 6 0 0 


CTCTOACTOO 


AGOCCCTOCA 


CTOACTO ACO 


CCAOOOTOCC 


CAGCCCAGOG 


TCTCTOOCOC 


6 6 6 0 


CATCCAOCT.G 


CACTOOOTTT 


0G0TOCTOOT 


CCTOCCCCCA 


AOCTOCCCGG 


ACACCACAOG 


6 7 2 0 


CAOCCOOOOC 


TOCCCACTOO 


CCTCOGTC AO 


OGTOAOCCCC 


AGCTGCCCCC 


OCTCAOOOCT 


6 7 8 0 


TOCCCCGAC A 


ATOACCCCAT 


CCTCAOOACG 


CACCCCCCTT 


CCCTTOCTOG 


GCAGTOTCCA 


6 8 40 


OCCCCACCCO 


AOATCOGOOG 


AAOCCCTATT 


TCTTOAC AAC 


TCCAOTCCCT 


GOGGGAGOGG 


690 0 


OCCTC AOACT 


OAOTOOTGAO 


TOTTCCCAAO 


TCCAGOAOOT 


GGTOGAGGGT 


CCTGOCOOAT 


6 9 6 0 


CCAOAOTTOA 


CAOTOAOGOC 


TTCCTGOOCC 


CCATOCGCCT 


OGCAGTGOCA 


GCAGGOAAGA 


7 0 2 0 


GGAAGCACCA 


TTTCAOOGGT 


OOOOGATOCC 


AOAOOCGCTC 


CCCACCCCGT 


CTTCOCCOOO 


7 0 8 0 


TOOTGACCCC 


QGOOGAOCCC 


COCTOOTCOT 


OOAGGGTOCT 


GOGOOCTGAC 


TAGCAACCCC 


7 140 


TCCCCCCCCO 


TTOGAACTCA 


CTTTTCTCCC 


OTCTTOACCO 


COTCCAOCCT 


TGAATG AGAA 


7 2 0 0 


CAAAOTCCTT 


OTOCTOGAC A 


CCOACTACAA 


AAAGT ACCTG 


CTCTTCTGCA 


TGGAAAACAG 


72 6 0 


TOCTOAOCCC 


GAGCAAAOCC 


TOOCCTGCCA 


GTOCCTOOOT 


GGOTGCCAAC 


CCTOOCTGCC 


7 3 2 0 


CAOOOAOACC 


AOCTGCOTOG 


TCCTTOCTGC 


AACAOGOGOT 


GGGGOGTGGG 


AOCTTGATCC 


73 8 0 
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7 5 6 0 
7 62 0 
7610 



S 2 2 0 
t 2 » 0 
t 3 4 0 



CCAOOAOOAO OAOOOOTOOO OOOTCCCTOA OTCCCOCCAO OAOAOAOTOO TCOCATACCO 7440 

OOAOCCAOTC TOCTOTOOOC CTOTOOOTOO CTOOOOACOO GOGCCAGACA CACAOOCCOO 7500 

OAOACOOOTO OOCTOCAOAA CTOIOACTOO TOTOACCOTC OCOATOOOOC COOTOOICAC 

TOAATCTAAC AGCCTTTOTT ACCOOOOAOT TTCAATTATT T C C C AAA AT A AOAACTCAOG 

TACAAAOCCA TCTTTCAACT ATCACATCCT GAAAACAAAT GOCAOOTOAC ATTTTCTGTG 

CCOTAOCAGT CCCACTGGGC ATTTTCAOOO CCCCTOTGCC AOGOOGGCGC OOOCATCOOC 7740 

OAOTOOAOGC TCCTGGCTOT OTCAGCCOGC CCAOGOOOAO OAAOOOACCC GOACAOCCAO 7*00 

AGGTOGGGGG CAGGCTTTCC CCCTOTOACC TOCAOACCCA CTOCACTGCC CTGGOAGG A A 7*60 

OGOAGOGGAA CTAOOCCAAO OOOOAAOOOC AGG TGCT CTG OAGGOCAAOG OCAOACCTOC 7920 

AGACCACCCT OGGOAGCAGG QACTOACCCC CGTCCCTOCC CCATAGTCAG OACCCCOOAO 79*0 

GTOGACAACO AGGCCCTGGA GAAATTCOAC AAAOCCCTCA AOOCCCTOCC CATOCACATC 1040 

CGGCTTGCCT TCAACCCGAC CCAOCTOOAO GOTOAOCACC CAOOCCCCOC CCTTCCCCAO S100 

GOCAOGAGCC ACCCOGCCCC OGGACOACCT CCTCCCATOO TOACCCCCAG CTCCCCAOOC 1160 

CTCCCAGOAG GAAGGGGTOG OGTOCAGCAC CCCOTGGGOO CCCCCTCCCC ACCCCCTOCC 

AOGCCTCTCT TCCCOAOGTO TCCAGTCCCA TCCTOACCCC CCCATGACTC TCCCTCCCCC 

ACAGGGCAGT OCCACOTCTA OGTOAOCCCC TOCCOOTGCC TCTOOGGTAA OCTGCCTOCC 

CTGCCCCACG TCCTOGOCAC ACACATGOOG TAOOOOOTCT TOOTOGOGCC TOOOACCCCA 1400 

CATCAOOCCC TGGOGTCCCC CCTGTOAOAA TGOCTOOAAO CTGGOOTCCC TCCTGOCOAC *460 

TGCAGAOCTG OCTOOCCOCG TOCCACTCTT GTOGOTGACC TGTGTCCTOO CCTCACACAC *520 

TGACCTCCTC CAGCTCCTTC CAGCAGAOCT AAOOCTAAOT OAOCCAOAAT OGTACCTAAG »5»0 

OGGAOOCTAG COGTCCTTCT CCCGAGOAOO OOCTOTCCTG GAACCACCAO CCATOOAGAG *640 

G CTGOCAAOG GTCTOGCAOO TOCCCCAOOA ATCACAOOGO OOCCCCATOT CCATTTCAOO 1700 

GCCCGGOAOC CTTGOACTCC TCTOOOOACA OACOACOTCA CCACCGCCCC CCCCCCATCA S760 

0OGOOACTAG AAOOOACCAG OACTOCAOTC ACCCTTCCTO GGACCCAOGC CCCTCCAGOC »*20 

CCCTCCTOOG GCTCCTOCTC TOOGCAOCTT CTCCTTCACC AATAAAGGCA T A AAC CTGTG ***0 

CTCTCCCTTC TGAOTCTTTG CTOOACOACO OOCAOOGOOT OGAGAAGTOG TOOOOAOOOA »940 

OTCTGOCTCA OAOOATOACA OCGGOOCTOO OATCCAOOOC OTCTOCATCA CAOTCTTOTO 9000 

ACAACTOOOO GCCCACACAC ATCACTOCOO CTCTTTOAAA CTTTCAGOAA CCAOOOAOOO 9060 

ACTCOOCAGA OACATCTOCC AOTTCACTTO OAOTOTTCAO TCAACACCCA AACTCOACAA 9120 

AGOACAGAAA OTOGAAAATO OCTOTCTCTT AOTCTAATAA ATATTOATAT OAAACTCAAO 9 1*0 

TTOCTCATOO ATCAATATOC CTTTATOATC CAOCCAOCCA CTACTOTCGT ATCAACTCAT 9240 

OTACCCAAAC GCACTOATCT OTCTOOCTAA TOATOAGAOA TTCCCAGTAG AGAGCTGOCA 9300 

AGAOGTCACA OTGAOAACTO TCTOCACACA CAOCAOAOTC CACCAGTCAT CCTAAGGAGA 9360 

TCAGTCCTGO TOTTCATTOO AGGACTGATG TTOAAOCTOA AACTCCAATO CTTTOGCCAC 9420 

CTOATGTOAA GAOCTOACTC A TT TO A A A AG ACCCTOATOC TGOGAAAOAT TOAOGOCAGO 94*0 

AOGAOAAGGO GACGACAGAG OATOAOATOO TTOOATOOCA TCACCAACAC AATGOACATG 9540 

OOTTTGOOTO OACTCCAGOA OTTOOTOATO GACAOOGAOG CCTGOCGTGC TACGOAAOCO 9600 

OTTTATOOOO TCACAAAGAC TOAOTOACTO AACTQAOCTO AACTOAATOO AAATOAGOTA 9660 

TACAGCAAAG TGGOGATTTT TTAOATAATA AOAATATACA CATAACATAO TGTATACTCA 9720 

TATTTTTATG CATACCTOAA TOCTCAOTCA CTCAOTCOTA TCTGACTCTG TOACCTATOO 97*0 



67 68 

-continued 

ACCOTAOCCT TCCAGOTTTC TTCTOTCCAC AGAATTCTCC AAGGCAAGAA TACTGGAGTG 9140 

OOTAOCCATT TCCTCCTCCA GOOGATCCTC CCOACCCAGO OATTGAACCG OCATCTCCTO 99 00 

TATTGGCAOG TGGATTCTTT ACCACTOTOC CACCAGOGAA OCCCGTOTTA CTCTCTATOT 9960 

CCCACTTAAT TACCAAAOCT OCTCCAAOAA AAAOCCCCTG TOCCCTCTOA OCTTCCCOOC 10020 

CTOCAOAOGO TGGTGGGGOT AOACTGTOAC CTOOOAACAC CCTCCCOCTT CAOGACTCCC 10010 

GOOCCACOTG ACCCACAGTC CTOCAGACAG CCOOOTAOCT CTGCTCTTCA AGOCTCATTA 10140 

TCTTTAAAAA AAACTGAOGT CTATTTTOTG ACTICOCTGC CGTAACTTCT OA AC AT CC AO 10200 

TOCOATOOAC AOOACCTCCT CCCCAOGCCT CAOOOOCTTC AGOOAGCCAO CCTTCACCTA 10260 

TOAGTCACCA GACACTCOOG OOTOOCCCCO CCTTCAOOGT OCTCACAGTC TTCCCATCGT 103 20 

CCTGATCAAA GAGCAAGACC AATOACTTCT T AGO AG C A AG CAGACACCCA CAGGACACTO 10310 

AGGTTCACCA OAOCTGAGCT OTCCTTTTGA ACCTAAAOAC ACACAOCTCT CGAAGGTTTT 10440 

CTCTTTAATC TGGATTTAAG GCCTACT TGC CCCTCAAOAG GOAAOACAOT CCTOCATGTC 1 0500 

CCCAGOACAG CCACTCOOTO OCATCCGAGG CCACTTAGTA TTATCTOACC GCACCCTGGA 10360 

ATTAATCOOT CCAAACTOOA CAAAAACCTT OGTOOOAAGT TTC ATCCCAG AOGCCTCAAC 10620 

CATCCTOCTT TOACCACCCT GCATCTTTTT TTCTTTTATO TOTATGCATO TATATATATA 10610 

T AT AT ATTTT TTTTTTTTTC ATTTTTTOGC TOT OCTGGCT GTTCGTTOC A OTTCGGTOCO 1 0740 

CAOOCTTCTC TCTAOTTTCT CTCTAOTCTT CTCTTATCAC AGAGCAGTCT CTAOACOATC 10S00 

OACOCGT 10107 

( 2 ) INFORMATION FOR SEQ ID NChS: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 47 bate pm 
( B ) TYPE: nucleic «od 
( C ) STRANDEDNESS: ntfe 
( D ) TOPOLOGY: hncar 

{ x i ) SEQUENCE DESCRIPTION: SEQ ID f*>4 

AATTCCGATC OACOCOTCOA COATATACTC TAOACOATCO ACOCOTA 47 

( 2 ) INFORMATION FOR SEQ ID NO^. 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 buc pm 
( B ) TYPE: nude* mad 
( C ) STRANDEDNESS; w$k 
( D ) TOPOLOGY: hacm 

(▼it ) IMMEDIATE SOURCE: 

( B ) CLONE: BLQAMP3 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOS- 

TOOATCCCCT OCCOOTOCCT CTOO 24 

( 2 > INFORMATION FOR SEQ ID NO:10: 

( i ) SEQUENCE CHARACTERISTICS. 

( A ) LENGTH: 24 base pm 
( B ) TYPE: anckk «ad 
( C ) STRANDEDNESS: imgk 
( D ) TOPOLOGY: line* 

( v i i ) IMMEDIATE SOURCE- 

( B ) CLONE: BLOAMP4 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



AACGCGTCAT CCTCTOTOAG CCAG 24 
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( 2 ) INFORMATION FOR SEQ ID NO:ll: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: SO fane pain 
< B ) TYPE: Mickic acid 
( C ) STRANDEDNESS: smgle 
{ D ) TOPOLOGY: Baesr 



(Tii ) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6839 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:U: 

ACTACOTAGT 

( 2 ) INFORMATION FOR SEQ ID NO:12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 42 Use pan 
( B ) TYPE: nucleic mad 
( C ) STRANDEDNESS: single 
(D)TOPOLOOY:Iiaejr 

(Tii) IMMEDIATE SOURCE: 

( B ) CLONE: 206632 

( z i } SEQUENCE DESCRIPTION: SEQ ID NO*.12: 

COACOCOGAT CCTACOTACC TGCACCCATG TTTTCCATGA GO 



( 2 ) INFORMATION FOR SEQ ID NO: 13: 



( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 21 facte pan 
{ B ) TYPE: nocibc iod 
( C ) STRANDEDNESS: smgle 
{ D ) TOPOLOGY: fcttsr 

( t i i ) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6627 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGGGCTTCGO CAAOCTTCAC G 



( 2 ) INFORMATION FOR SEQ ID NO: 14: 



< t > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 bc*c pm 
( B ) TYPE: mickic acid 
( C ) STRANDEDNESS: task 
( D ) TOPOLOGY: hacm 



(tii) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6521 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
OCCAAAOACT TACTTCCCTC T AO A 



( 2 ) INFORMATION FOR SEQ IDNO:l5: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pm 
( B ) TYPE: codac acid 
( C ) STRANDEDNESS: wtk 
( D )TOf>OLOOY:fai*ar 

(Tii) IMMEDIATE SOURCE: 

( B ) CLONE: 2C6520 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

OCATOAACOT COCOTOOTOO TTOTGCTACC 30 
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( 2 )INR»MATIOfJP0RSBQIDNO:16: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 toe pain 
< B ) TYPE: anckac add 
( C ) STRANDEDNESS: tingle 
( D )TOPOUXJY:HDcar 

( t i i ) IMMEDIATE SOURCE: 

(B ) CLONE: ZC6519 

( z i ) SEQUENCE DESCRIPTION: SEQ ID NO:l<5: 

ACCACOCOAC GTTCATOCTC TAAAACCOTT 



( 2 ) INFORMATION FOR SEQ ID NO: 17: 

( x ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 ba»e pan 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: angle 
( D )TOPOLOOY: fisear 

( v i i ) IMMEDIATE SOURCE: 

( B ) CLONE: Z06518 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

OCTOCOOOAT CCTACGTACT AOOOGGACAO OOAAOG 36 



( 2 > INFORMATION FOR SEQ ID NO:lS: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 45 base pan 
( B ) TYPE: mickac add 
( C ) STRANDEDNESS: iagk 
{ D ) TOPOLOGY: Bacar 

(Tii ) IMMEDIATE SOURCE: 

< B ) CLONE: 2C6629 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCK18: 

CO A CG CO A AT TCTACGTACC TGCAOCCATG AAAAGGATOG TTTCT 43 



( 2 ) INFORMATION FOR SEQ ID NOU9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 45 haw pan 
( B ) TYPE: bdcIoc add 
< C ) STRANDEDNESS: cask 
( D ) TOPOLOGY: hoear 

(tii) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6630 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.19: 

COACOCOAAT TCTACGTACC TGCAOCCATG AAACATCTAT TATTO 43 



< 2 > INFORMATION FOR SEQ ZD NO20: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 b»e pan 
( B ) TYPE: auckx acid 
( C ) STRANDEDNESS: tmgk 
( D ) TOPOLOGY: Eaear 

(vii ) IMMEDIATE SOURCE: 

( B ) CLONE: ZC6625 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO20: 



OTGAOATTTT 



CAOATCTTOT C 
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( 2 ) INFORMATION FOR SEQ ID N021: 

( i ) SEQUENCE CHARACTERISTICS: 
(A ) LENGTH: 21 bate pan 
( B ) TYPE: aDdeic acid 
( C ) STRANDEDNESS: aagl* 
(D ) TOPOLOGY: Baear 

(Ti i ) IMMEDIATE SOURCE: 

(B ) CLONE: ZC6626 

( x i ) SEQUENCE DESCRIPTION: SBQ ID N021: 



( 2 ) INFORMATION FOR SEQ ID N022: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 bmc pan 
< B ) TYPE: anckie acid 
( C ) STRANDEDNESS: aaak 
(D ) TOPOLOGY: E« 

(ti i ) IMMEDIATE SOURCE: 

( B ) CLONE: 206624 

< x i ) SEQUENCE DESCRIPTION: SBQ ID N022: 

OCTOCOGAAT TCTACOTACT ATTGCTOTOO OAA 
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( 2 ) INFORMATION FOR SEQ ID N023: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 45 b^e pn 
( B )TYPE:«kic*iJ 
( C ) STRANDEDNESS: caafe 
( D ) TOPOLOGY: fiaear 

(Tti ) IMMEDIATE SOURCE: 

(B ) CLONE: 2C65W 

( x i ) SEQUENCE DESCRIPTION: SBQ ID N023: 

COACOCOOAT CCTACOTACC TOCAOCCATO AOTTOOTCCT TOCAC 



( 2 ) INFORMATION FOR SEQ ID N024: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 trepan 
( B ) TYPE: aociac acid 
( C ) STRANDEDNESS: aajk 
( D ) TOPOLOGY: hawr 

) IMMEDIATE SOURCE: 

{ B ) CLONE: ao65I7 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NOdi: 

OTCTCTOGTA GC A AC AT ACT A 



( 2 ) INFORMATION FOR SEQ ID N025: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 2 tee pan 
( B ) TYPE: aackie acid 
( C ) STRANDEDNESS: na*te 
( D )TOPOLOOY:B*Mr 

(*ii ) IMMEDIATE SOURCE: 

< B ) CLONE: ac6S16 

( x i ) SEQUENCE DESCRIPTION: SBQ ID N025: 



OOOTTTCTAO CCCTACTAOT AO 
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-continued 
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( 2 ) INFORMATION FOR SEQ ID N036: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 imc pain 
C B ) TYPE: nckk add 
( C ) STRANDEDNESS: sa£e 
(D ) TOPOLOGY: lbe» 

( v i i ) IMMEDIATE SOURCE: 

( B ) CLONE: xo6515 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N026: 

OOOTTTCTAG CCCTACTAOT AO 22 

( 2 ) INFORMATION FOR SEQ ID N027: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 47 b«ae pam 
(B )TYP£:«ockic*ad 
( C ) STRANDEDNESS: sn^e 
(D>TOPOLOOY:fiae« 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCfc27: 

AAOCTACOCG TCOATCOTCT AGAOTATATC GTCGJtCOCGT COATCGG 4> 7 



We claim: 

1. A method for producing biocompetcnt fibrinogen com- 
prising: 

providing a first DNA segment encoding a secretion 
signal operably linked to a heterologous fibrinogen Act 
chain. 



the DN A segment comprising 
genomic DNA encoding the Aa 
chain; 



a second DNA segment encoding a secretion 
signal operably linked to a heterologous fibrinogen Bf5 
chain. 



the DNA segment comprising 
genomic DNA encoding the BB 
chain ; and 



a third DNA segment encoding a secretion 35 
signal operably linked to a heterologous fibrinogen y 

chain. 



the DNA segment comprising 
genomic DNA encoding the v 
chain . 



wherein each chain is from the same species, and 
wherein each of said first, second and third segments is 
operabiy linked to additional DNA segments required 
for its expression in the mammary gland of a host 40 
female mammal; 

introducing said DNA segments into a fertilized egg of a 
non-human mammalian species heterologous to the 
species of origin of said fibrinogen chains; 

inserting said egg into an oviduct or uterus of a female of 45 
said mammalian species to obtain offspring carrying 
said DNA segments; 

breeding said offspring to produce female progeny that 
express said first, second and third DNA segments and 
produce milk containing biocompetent fibrinogen 50 
encoded by said segments; 

collecting milk from said female progeny; and 

and recovering the biocompetent fibrinogen from the 

mil lr 

2. A method according to claim 1 wherein said species 55 
into which said DNA segments are introduced is selected 
from the group consisting of sheep, pigs, goats, and cattle. 

3. A method according to claim 1 wherein each of said 
first, second and third DNA segments comprises an intron. 

4. A method according to claim 1 wherein the molar ratio 60 
of said first, second and third DNA segments is within the 
range of 0.5-1:0.5-1:0.5-1. 

5. A method according to claim 1 wherein each of said 
first, second and third DNA segments is operabiy linked to 

a transcription promoter selected from the group consisting 65 
of casein, ^-lactoglobulin, oc-lactalbumin and whey acidic 
protein gene promoters. 



6. A method according to claim 1 wherein said first 
second tnd third DNA segments arc expressed under the 
control of a $-lactoglobuIin promoter. 

7. A method according to claim 1 wherein said introduc- 
ing step comprises injecting said first* second and third DNA 
segments into a pronucleus of said fertilized egg. 

8. A method according to claim 1 wherein said fibrinogen 
is human fibrinogen. 

9. A method according to claim 1 wherein said second 
DNA segment comprises a sequence of nucleotides as 
shown in SEQ ID NO: 3 from nucleotide 470 to nucleotide 
8100. 

10. A method according to claim 1 wherein said second 
DNA segment comprises a sequence of nucleotides as 
shown in SEQ ID NO: 3 from nucleotide 512 to nucleotide 

8100. 

11. A method according to claim 1 wherein said species 
into which said DNA segments is introduced is sheep. 

12. A method of producing biocompetent fibrinogen com- 
prising: 

incorporating a first DNA segment encoding a secretion 
signal operabl y linked to an Aa chain of fibrinogen into 
a fS-laaoglobuiic gene to produce a first gene fusion 
comprising a f^Uaoglobulin promoter operably linked 
to the first DNA segment 



, the DNA segment comprising 
genomic DNA encoding the Aa 
chain; 



incorporaung a second DNA segment encoding a secre- 
tion signii operably linked to a B(S chain of fibrinogen 
into a ^-laaoglobulin gene to produce a second gene 
fusion comprising a f^-lactoglobulin promoter operably 
linked to the second DNA segment 



, the DNA segment comprising 
genomic DNA encoding the BB 
chain; 



incorporating a third DNA segment encoding a secretion 
signal opcrably linked to a y chain of fibrinogen into a 
(Mactoglobulin gene to produce a third gene fusion 
comprising a {}-laaoglobulin promoter opcrably KnirM 
to the third DNA segment 



, the DNA segment comprising 

genomic DNA encoding the v 
chain, 



wherein each of said first, 
second and third segments are of the same spedts; 
introducing said first, second and third gene fusions into 
the germ line of a non-human mammal so that said 
DNA segments are expressed in a mammary gland of 
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said mammal or its female progeny and biocompetent 
fibrinogen is secreted into milk of said mammal or its 
female progeny; 
obtaining milk from said mammal or its female progeny; 
and 

recovering said fibrinogen from said miifr 

13. A method according to rfaim 12 wherein said mammal 
is a sheep, pig, goat or cow. 

14. A method according to r\»m 12 wherein each of said 
first, second and third gene fusions comprises an intron. 

15. A method according to rfaim 12 wherein the molar 
ratio of said first, second and third gene fusions introduced 
is within the range of 0.5-1:0.5-1:0.5-1. 

16. A method according to claim 12 wherein said intro- 
ducing step comprises injecting said first, second and third 15 
gene fusions into a pronucleus of a fertilized egg and 
inserting said egg into an oviduct of a pseudopregnant 
female to produce female offspring carrying said gene 
fusions in the germ line, wherein said egg and said 
pseudopregnant female are of the same species. 20 

17. A method according to claim 12 wherein said mammal 
is a sheep* 

18. A method for producing biocompetent fibrinogen 
comprising: 

providing a transgenic female non-human mammal car- 
rying in its germline heterologous 

genomic 



25 



DNA segments 

encoding Aa, Bp and y chains of fibrinogen, wherein 
said segments are expressed in a mammary gland of 
said mammal and biocompetent fibrinogen encoded by 
said segments is secreted into ™nv 0 f said mammal; 

collecting milk from said mammal - 

recovering said biocompetent fibrinogen from said miu- 

19. A method according to claim 18 wherein said mammal 
is a sheep, pig, goat or cow. 

20. A method according to claim 18 wherein said mammal 
is a sheep. 

21. A transgenic non-human female mammal that pro- 
duces recoverable amounts of biocompetent human fibrino- 
gen in its milk, wherein said mammal comprises: 

a first DNA segment encoding a secretion signal operably 
linked to a heterologous fibrinogen Aa chain. 



the DNA segment comprising 
heterologous genomic DNA 
encoding the Aa chain ; 

a second DNA segment encoding a secretion signal oper- 
ably linked to a heterologous fibrinogen Bp chain, 

the DNA segmen t comprising 
heterol ogous genomic DNA 
encodin g the Bp rha-in- 



and 

a third DNA segment encoding a secretion signal operably 
linked to a heterologous fibrinogen y chain. 



3 

m 

m 

m 
m 

m .the DNA segment r ^ Dr j s j nq 

ffi heterologous gpn ^j^ nrja 

encoding t he v rh.in. 

O 
I- 

IurthCT wherein each chain is derived from the same species 
* fi Md * °PeraWy linked lo additional DN A segments required 

for its expression in the mammary gland of a host female 
mammal. 
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22. A mammal according to claim 21 wherein said mam- 
mal is a sheep. 

23. A process for producing a transgenic offspring of a 
TnaTTwnai comprising: 

providing a first DNA segment encoding a secretion 
signal operably linked to a heterologous fibrinogen Aa 
chain, 

the DNA segme nt comp rising 
genomic DNA e ncoding the Aa 
chain ; 



10 



15 



20 



the DNA seamfint- comp ri.^ng 
genomic dna finrnH j nCT thf > n o 
chain ; 



the DNA s fi gment comprising 
genomic DN A encoding the v 
chain , 

wherein each chain is derived from the same 
species, and wherein each of said first, second and third 
segments is operably linked to additional DNA sec. 
ments required for its expression in the rnammary eland 
of a host female mammal; 
introducing said DNA segments into a fertilized egg of a 
non-human mammalian species heterologous to the 
species of origin of said fibrinogen chains; 
inserting said fertilized egg into an oviduct or uterus of a 

female of said mammalian species; and 
allowing said fertilized egg to develop thereby producing 
transgenic offspring carrying said first second and third 
DNA segments, wherein female progeny of said mam- 
mal express said DNA segments in a rnammary gland 
25 to produce biocompetent fibrinogen. 

24. A process according to claim 23 wherein said off- 
spring is female. 

25. A process according to claim 23 wherein said off- 
spring is male. 

30 26. A non-human mammal produced according to the 
process of claim 23. 

27. A non-human mammal according to claim 26 wherein 
said mammal is female. 

28. A non-human female mammal according to claim 27 
35 that produces milk containing biocompetent fibrinogen 

encoded by said DNA segments. 

29. A non-human m a mmal according to claim 26 wherein 
said mammal is male. 

30. A non-human mammal carrying in its germline 



heterologous genomic 



DNA 

segments encoding human Act, and y chains of 
fibrinogen, wherein female progeny of said mammal express 
said DNA segments in a mammary gland to produce bio- 
competent human fibrinogen. 

31. A mammal non-human according 10 claim 30 wherein 
said mammal is female. 

32. A mammal non-human according to claim 30 wherein 
said mammal is male. 

33. A mammal according to claim 30. wherein said 
mammal is a sheep. 



0 

X 1 



34. A set of DNA sequences 
comprising: 

a first DNA segment 
encoding a secretion signal 
operablv linked to a 
heterologous fibrinogen Aq 
chain, the DNA segment 
comprising genomic DNA 
encoding the Aq chain; 

a second DNA segment 
encoding a secretion signal 
operablv linked to a 
heterologous fibrinogen B3 
chain, the DNA segment 
comprising genomic DNA 
encoding the BB chain; and 

a third DNA segment 
encoding a secretion signal 
operablv linked to a to a 
heterologous fibrinogen v 
chain, the DNA segment 
comprising genomic DNA 
encoding the v chain, wherein 
each chain is from the same 
species, and wherein each of 
said first, second and third 
segments is operablv linked to 
additional DNA segments 
reguired for its expression in 
the mammary gland of a host 
female mammal. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Reissue Application Not yet assigned 

Serial No. : Not yet assigned 

Filed January 15, 1999 

Patent 5,639,940 

Patentee Ian Garner, Michael A, . Dalrymple, * Donna E. Prunkard 

and Donald C. Foster 

Assignee Pharmaceutical Proteins, Ltd., and ZymoGenctics. Inc. 

Issued : June 17, 1997 

Application 08/206,176 
Filed March 3, 1994 

For PRODUCTION OF FIBRINOGEN IN TRANSGENIC 

ANIMALS 



REISSUE DECLARATION AND POWER OF ATTORNEY 

We, DCNNAE. PRUNKARD and DONALD C. FOSTER, two of the four 
named inventors of United States patent 5,639,940 ("the '940 patent"), and applicants for 
reissue thereof, declare that: 

1. We are citizens of the United States and have residences and post office 
addresses as stated below under our respective names: 

Donna E. Prunkard 
1463 N.W, 92nd St. 
Seattle, WA 98117 



* The '940 patent incorrectly printed the middle initial of Dr. Dalrymple as "L' : 



EXPRESS MAIL 
EI187448225US 



Donald C. Foster 

3002 NE 181st Street 

Lake Forest Park, WA 98 1 55 

2 . By concurrent Petition Under 3 7 C.F.R. § 1 . 324 and Motion Under 3 7 
C.F.R, § 1 .634, the four originally named inventors, Ian Gamer, Michael A. DaJrymplc, Donna 
E. Prunkard, and Donald C. Foster, have petitioned and moved to correct the inventorship for 
all of the claims of the '940 patent. As amended, Donna E. Prunkard and Donald C. Foster are 
the joint inventors of the subject matter claimed in the '940 patent. 

3 . We have reviewed and understand the contents of the '940 patent and this 
application for reissue thereof, including their claims, and believe that we are the origir.il, first 
and joint inventors of the invention described and claimed in the '940 patent to the extent that a 
reissue application is bong sought on that invention. 

4. We make this declaration in support of this application for reissue of the 
'940 patent and acknowledge our duty to disclose to the United States Patent and Trademark 
Office afl inforrnation known to us to be material to the patentability of the reissue application 
and its claims as defined in 37 C.F.R, § 1 .56(a). 

5 . We do not know and do not believe that our invention was ever patented 
or described in any printed publication in any country before our invention thereof or more than 
one year prior to the March 3, 1994 filing date of out United States patent application 
08/206, 176, which issued as the '940 patent, or in public use or on sale in the United States 
more than one year prior to that filing date. 

6. We believe that the '940 patent is "wholly or partly invalid" because of 
error, without deceptive intent, by reason of our having claimed more, than we had a right to 
claim in view of the prior art, Specifically, all of the claims of the '940 patent recite methods and 

2 



transgenic animals employing "DNA" encoding, respectively, each of the A«, Bp and y chains 
of fibrinogen. To be patentable over the prior art, we believe that these claims should be limited 
to genomic DNA and thereby exclude cDNA. 

7. In obtaining allowance of the claims of the '940 patent, we understand 
that our attorney argued that the claims ware patentable over the prior art based on the 
unexpected production of useful amounts of biologically active fibrinogen in the milk of 
transgenic animals. See Amendment, May B, 1995, pp. 9-12. We understand that at the request 
of the Examiner and in view of that argument, our attorney then amended the claims to recite 
"biocompetent fibrinogen''. Sge Office Action, Paper No. 13, May 21, 1996, p. 4; Amendment, 
October 1, 1996, pp. 1-2, 6-8. It was this amendment that wc understood led to allowance of 
the claims and grant of the *940 patent, 

8. We do not believe that such amendment renders the claims of the '940 
patent patentable over the prior art. We believe at the time we made our invention, in view of 
art published more than one year before the filing date of the application that issued as the 
£ 940 patent that it would have been obvious to the skilled worker in the recombinant fibrinogen 
expression and transgenic arts that biocompetent fibrinogen would be produced in the milk of 
transgenic animals carrying the cDNAs encoding the Art, B (3 and y chains of fibrinogen at about 
the same level on a per cell basis as had already been produced in mammalian cells transfected 
with those cDNAs, i.e. 3 about Sjigfaig of cellular protein/day. Indeed, we understood that when 
such cDNAs were actually used to produce transgenic mice, those mice produced in their milk 
biocompetent fibrinogen at about that expected level. 

9. We also believe that at the time we made our invention, in view of prior 
art published more than one year before we filed our application, that the skilled worker would 
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have reasonably expected that gensfflis DNAs encoding the Aos, Bp and y chains of fibrinogen 
could be used to produce biocompetent fibrinogen in the milk of transgenic animals in the best 
case at that same level and potentially at much lower levels. We believe, however, that at the 
time we made our invention, in view of prior art published more than one year before we filed 
our application, it would not have been obvious to that skilled worker that those genomic DNAs 
could be used to produce biocompetent fibrinogen in the milk of transgenic animals with a 
reasonable expectation of success at levels on a per cell basis higher than those already produced 
in mammalian cells transfected with cDNAs encoding the three fibrinogen claims. It was to this 
unexpectedly high expression level that we understood our attorney was referring when he 
argued to the Examiner that our invention was patentable because it unexpectedly produced 
useful amounts of biocompetent fibrinogen, 

1 0. We believe that the error described in paragraphs 7-9 above arose 
without deceptive intent, All of the examples in our application used genomic DNA encoding 
the Aa, B(3 and y chains of human fibrinogen, We understood that when our attorney made the 
above arguments to the Examiner and amended our claims to obtain their allowance he had in 
mind the genomic DNA as used in our Examples, £eg the '940 patent. Examples U, HI, and IV, 
We understood that he failed to consider that the claims literally included cDNA. 

11. We also believe that the "940 patent is also 4i wholly or partly inoperative" 
because of error, without deceptive intent, by reason of our having claimed ]sSS. than we had a 
right to claim in view of the disclosure of the application that issued as the B 940 patent. 
Specifically B none of the patent claims is directed to a set comprising a first, second, and third 
DNA segment encoding a secretion signal operably linked to the heterologous fibrinogen Act, 
BP and y chains, respectively, each of the DNA segments comprising genomic DNA encoding 



the respective fibrinogen chain, and wherein each of the chains is from the same species and is 
operably linked to additional DNA segments required for expression in the mammary glands of a 
host female mammal. This set of DNA segments is usefUl in the methods of claims 1-20 and 
23-25 of the '940 patent and in producing the non-human mammals of claims 21-22 and 26-33 
of the '940 patent. This set ofDNA segments is also described and enabled in the '940 patent 
and should have been claimed in that patent and in the application from which it issued. Without 
a claim to this set of DNA segments, we understand that a third party might potentially prepare 
the recited set of DNA segments in the United States and export that set from the United States 
for use outside the United States in the production of fibrinogen in the milk of transgenic 
animals. We also understand that without reissue such acts would not infringe the claims of the 
'940 patent, 

12, We understand and believe that the error described in paragraph 1 1 arose 
without deceptive intent. 

13, We understand and believe that these errors in the claims of the '940 
patent were discovered while reviewing the '940 patent, with counsel, in connection with Ga raa; 
v. Velandffi . Interference 104,242. We have filed this reissue application promptly after 
recognizing the errors, 

14, As named inventors, we hereby appoint the following attorneys and 
agents in connection with the '940 patent and this application for reissue thereof, with full power 
to prosecute this application for reissue and to transact all business in the United States Patent 
and Trademark Office in connection therewith and with the *940 patent: 

James F. Haley (Reg, No. 27,794) 
Karen Mangesarian (Reg. No. p43>772) 
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Z.YingLi(Reg. No. 42,800) 

all of Fish & Neave, 1251 Avenue of the Americas, 49th Floor, New York, NY 10020 

Send correspondence to: James R Haley, Jr. 

1251 Avenue of the Americas 
New York, New York 100204 104 

Direct telephone colls to: James F, Haley 

(212) 596-9000 

15, We hereby declare that we understand the English language, and that all 
statements made herein of our own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements are made with 
the knowledge that willfttl false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of this application for reissue or any Patent 
issued thereon. 




DONNA E. PRUNtCARD DONALD C. FOSTER 

Date: /-/3-?f Date: V - -<W 
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DECLARATION IN SUPPORT OF REISSUE APPLICATION 



I, GARYE. PARKER, declare that: 

1 , I make this declaration in support of the accompanying Reissue 
Application for United States patent 5,639,940. 

2, I am Principle Patent Agent at ZymoGenetics, Inc., Seattle, 
Washington, In March 1994, 1 was the Manager of the Patent Department at 
ZymoGenetics. I am registered to practice before the United States Patent and Trademark 
Office. I am not an attorney. 



The '940 patent incorrectly printed the middle initial of Dr. Dakymple as "L" 



EXPRESS MAIL 
EI187448225US 



, 07 i.ue^-Q CCCT -CT -klHT 



3. I prepared and filed the * 1 76 application that issued as the above- 
identified '940 patent. 

4. I understand and believe that the £ 940 patent is '"wholly or partly 
invalid" because of error, without deceptive intent, I believe that the patent claims more 
than the inventors had a right to claim in view erf the prior art. Particularly, all of the 
claims of the 4 940 patent recite methods and transgenic animals employing "DNA" 
encoding, respectively, each of the Aa, Bp and y chains of fibrinogen. I believe that these 
claims should be limited to gffiomie DNA, and thereby exclude cDNA, to be patentable 
over the prior art. 

5. In obtaining allowance of the claims of the c 940 patent over the 
prior art, I argued that the claims were patente&le because production of useful amounts 
of biologically active, fibrinogen in the milk of transgenic animals was unexpected. See 
Amendment, May 8, 1995, pp. 9-12. At the request of the Examiner and in view of that 
argument, I amended the claims to recite "biocompetent fibrinogen". §ge Office Action, 
Paper No. 13, May 21, 1996, p. 4; Amendment, October 1, 1996, pp. 1-2, 6-8. I believe 
that it was this amendment that led to allowance of the claims and grant of the ( 94Q 
patent. 

6. I do not now believe that my amendment renders the claims of the 
'940 patent patentable. I understand and believe that at the time the inventors made their 
invention in view of art published more than one year before the filing date of the 
application that issued as the '940 patent, it would have been obvious to the skilled 
worker in the recombinant fibrinogen expression &nd transgenic arts that biocompetent 
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human fibrinogen would be produced in the milk of transgenic animals carrying the 
cDNAs encoding the Aa, Bp and Y chains of human fibrinogen at about the same level on 
a per cell basis as had already been produced in mammalian cells transfected with those 
cDNAs, i.e., about Z(ig/2xl0 6 ceUs/day,* In fact, when cDNAs encoding the three chains 
of fibrinogen were used to produce transgenic mice, those mice produced biocompetent 
fibrinogen in their tniik at about die expected level. 

7, I also understand and believe that at the time of invention, in view 
of prior art published more than one year before the application was filed, that it would 
not have been obvious to that skilled worker that genomic DNAs encoding the Aa, Bp 
and y chains of fibrinogen could be successfully used to produce biocompetent fibrinogen 
in the milk of transgenic animals at levels on a per cell basis higher than those already 
produced in mammalian cells using the corresponding cDNAs. It was to this unexpectedly 
higher production of biocompetent fibrinogen which I referred when I argued that the 
Garner claims were patentable. See, tiupra, \ 5, 

S. I believe that the error described in paragraphs 4-7 above arose 
without deceptive intent. Although I do not recall the specific details of the prosecution 
of the application that issued as the '940 patent, I believe that during prosecution 1 did not 
consider that the claims covered low level expression of cDNA, All of the examples in the 
application used genomic DNA encoding the Act, Bp and y chains of human fibrinogen. 
See the "940 patent, Examples II, in, and IV. And, it was only genomic DNA that the 
inventors had used to produce useful amounts of biocompetent fibrinogen in the milk of 

* During prosecution of the application that issued as the Gamer '940 palent h the 
Examiner cited Roy et al. for its report of the production of 2*ig of fibrinogen per 
2xl0 6 cells/day in COS cells, 
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transgenic animals. It was this work than was the basis of my patentability arguments. As 
best I can recollect, I foiled to consider or realize that the pending claims literally included 
cDNA 

9. I also understand and believe that the '940 patent is also "wholly or 
partly inoperative" because of error, without deceptive intent. I believe that the patent 
claims less than the inventors had a right to claim in view of the disclosure of the 
application that issued as the '940 patent. None of the patent claims is directed to a set 
comprising a first, second, and third DNA segment encoding a secretion signal operably 
linked to the heterologous fibrinogen Aa, BP and y chains, respectively, each of the DNA 
segments comprising genomic DNA encoding the respective fibrinogen chain, and wherein 
each of the chains is from the same species and is operably linked to additional DNA 
segments required for expression in the mammaty glands of a host female mammal This 
set of DNA segments is useful in the methods of claims 1-20 and 23-25 of the <940 patent 
and in producing the non-human mammals of claims 21-22 and 26*33 af the '940 patent. 
1 also believe that this set of DNA segments is described and enabled in the '940 patent. 
Finally, I believe that the set of DNA segments should have been clainted in the 
'940 patent and in the application from which it issued. Without a claim to this set of 
DNA segments, I understand and believe that a third party might potentially prepare the 
recited set of DNA segments in the United States and export that set from the 
United States for use outside the United States in the production of fibrinogen in the milk 
of transgenic animals. I also understand and believe that without reissue such acts would 
not infringe the claims of the '940 patent. 
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10. The error described in paragraph 1 1 arose without deceptive intent. 
Today, I do not recall the Specific details of the prosecution of the potent application that 
issued as the c 940 patent. Nonetheless, to the best of my recollection, 1 believe that I did 
not claim the set of DNA segments now claimed in reissue claim 34, because I was 
focusing on the end product - fibrinogen - and the methods and animals used to make it. 
I did not, as I recall, consider claiming the various intermediates - including this set of 
DNA segments « used in producing those animals or the claimed method. 

11. I discovered these errors (paragraphs 4-7 and 8-1 0) in the claims of 
the '940 patent while reviewing the k 940 patent, with counsel, in connection wth Gamer 
v Vclaqd er. Interference 104,242, 

12. I hereby declare that I understand the English language, and that all 
statements made herein of my knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements are made 
with the knowledge that willful false statements and the like so made arc punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code 
and that such willful false statements may jeopardize the validity of this application for 
reissue or any Patent issued thereon. 
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